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HUMAN TRANSPORT-ASSOCIATED MOLECULES 

TECHNICAL FIELD 

5 This invention relates to nucleic acid and amino acid sequences of human 

transport-associated molecules and to the use of these sequences in the diagnosis, 
treatment, and prevention of cancer and transport disorders. 

BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are bound by a lipid bilayer membrane and subdivided into 

functionally distinct, membrane bound compartments. The membranes maintain the 
essential differences between the cytosol, the extracellular environment, and the contents 
of each intercellular organelle. As lipid membranes are highly impermeable to most polar 
molecules transport of essential nutrients, metabolic waste products, cell signaling 

15 molecules, macromolecules and proteins across lipid membranes and between organelles 
must be mediated by a variety of transport-associated molecules. 

Transport of molecules between organelles or vesicles is essential for cell function. 
Eukaryotic proteins are synthesized within the endoplasmic reticulum (ER), delivered 
from the ER to the Golgi complex for post-translational processing and sorting, and 

20 transported from the Golgi to specific intracellular and extracellular destinations. This 
intracellular and extracellular transport of protein molecules is termed vesicle trafficking. 
Trafficking is accomplished by the packaging of protein molecules into specialized 
vesicles which bud from the donor organelle membrane, are transported, and fuse to the 
target membrane. Specialized cell types utilize specific vesicle trafficking routes. For 

25 instance, in endocrine glands, hormones and other secreted proteins are delivered to 
secretory granules for exocytosis through the plasma membrane to the cell exterior. In 
macrophages, peroxidases and proteases are delivered to lysosomes. In fat and muscle 
cells, glucose transporters are stored in vesicles which fuse with the plasma membrane in 
response to insulin stimulation. 

30 Numerous proteins are necessary for the formation, targeting, and fusion of 

transport vesicles and for the proper sorting of proteins into these vesicles. The vesicle 
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trafficking machinery includes coat proteins which promote the budding of vesicles from 
donor membranes; vesicle- and target-specific identifiers (v-SNAREs and t-SNAREs) 
which bind to each other and dock the vesicle to the target membrane; and proteins which 
bind to SNARE complexes and initiate fusion of the vesicle to the target membrane 
5 (SNAPs). 

The Rab family of small GTP-binding proteins are key regulators of vesicle 
transport events. Different Rab proteins are associated with distinct subcellular 
compartments and their vesiicular carriers. GTP binding and hydrolysis of Rab proteins 
regulates the specificity and direction of vesicle transport. Over 40 Rab proteins have 

10 been identified in mammalian cells with sequences that share between 35% and 95% 
identity indicating a broad range of functional specificities. Usually, these proteins are 
associated with specific cellular organelles. For example, Rabl is localized to the ER and 
Golgi complex, Rab2 in the transitional ER and the cis Golgi network, Rab3 to secretory 
vesicles, Rab4 to early endosomes, Rab5 to early endosomes and the plasma membrane, 

15 Rab6 to medial and trans Golgi cisternae, Rab7 to late endosomes, and Rab9 to late 
endosomes and the trans Golgi network. In addition, RAB proteins are localized to 
specific tissue types. RAB 17 is found in epithelial cells which contain distinct apical, 
basolateral, and transcytotic transport pathways, while Rab3a, Rabl 5, and Rab23 are 
predominantly expressed in the brain and nervous system. (Kuge, O. et al. (1993) J. Cell. 

20 Biol. 123:1727-1734; Olkkonen, V. M. et al. (1994) Gene 138:207-21 1; Olkkonen, V. M. 
et al. (1997) Int. Rev. Cytol. 176:1-85; and Stahl, B. (1994) J. Biol. Chem. 269:24770- 
24776.) 

The Golgi apparatus lies at the heart of the vesicle transport pathway, and consists 
of a stack of cisternae, together with the cis- and trans-Golgi networks on either face of the 

25 stack. Molecules are transported through the sequence of compartments that comprise the 
Golgi apparatus by vesicles which bud from one compartment and fuse with the next. 
During mitosis, the Golgi apparatus is disassembled into hundreds of vesicles. These 
fragments can then be partitioned between the two daughter cells. The majority of mitotic 
Golgi fragments are small uniform vesicles, produced by a coatomer I-dependent 

30 mechanism (COPI) which is responsible for both anterograde and retrograde transport 
through the Golgi stack. There is also a COPI-independent pathway which produces 
larger, more heterogeneous fragments. It is hypothesized that the accumulation of Golgi 
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fragments during mitosis is related to inhibition of membrane fusion. The pi 15 protein 
has been implicated in three Golgi- trafficking steps: 1) intra-Golgi traffic; 2) endoplasmic 
reticulum to Golgi traffic; and 3) transcytosis. The pi 15 has been shown to be involved in 
a docking step prior to membrane fusion, pi 1 5 binding to Golgi membranes is 
5 specifically inhibited under mitotic conditions as a result of modifications to the 
membranes. An early stage of mitotic Golgi disassembly in vitro can be inhibited by 
increasing pi 15 receptor occupancy with excess pi 15. These data suggest that the mitotic 
inhibition of pi 15 binding contributes to the inhibition of membrane fusion by acting at 
the early step of vesicle docking. (Sapperstein, S. K. etal. (1996) J. Cell Biol. 
10 132:755-767.) 

Vesicles in the process of budding from the ER and the Golgi are covered with a 
protein coat similar to the clathrin coat of endocytotic vesicles. The protein coat is 
assembled from cytosolic precursor molecules and is confined to budding regions of the 
organelle membrane. The coat protein (COP)-coated vesicles are uncoated after budding 
15 is complete to allow fusion of the vesicle to the target membrane. 

The COP coat consists of two major components, a guanosine triphosphatase and 
coat protomer (coatomer). The individual proteins are cytosolic until assembled into the 
protein coat. Coatomer is an equimolar complex of seven proteins, termed alpha-, beta-, 
beta'-, gamma-, delta-, epsilon- and zeta-COP. The coatomer complex binds to dilysine 

20 motifs and reversibly associates with Golgi non-clatherin coated vesicles. These vesicles 
further mediate transport from the ER , via the Golgi, to the trans-Golgi network. 
Coatomer complex is required for budding from the Golgi membranes and is essential for 
retrograde Golgi to ER transport of dilysine motif containing proteins. Polyclonal 
antibody directed to zeta-COP blocks the binding of coatomer to Golgi membranes and 

25 prevents the assembly of COP-coated vesicles on Golgi cisternae. Assay of a conditional 
lethal, temperature-sensitive epsilon-COP mutant of Chinese hamster ovary (CHO) cells 
indicates that epsilon-COP and associated COP coatomers may be necessary for the 
establishment or maintenance of Golgi structure, for proper ER-to-Golgi transport of 
integral membrane and secreted proteins, and for normal endocytotic recycling of low- 

30 density lipoprotein receptors. (Kuge, O. et al. (1993) supra; and Cosson, P. et al. (1996) 
EMBOJ. 15:1792-1798). 

Transport of proteins and RNA between the nucleus and the cytoplasm occurs 
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through nuclear pore complexes (NPCs). NPC-mediated transport occurs in both 
directions through the nuclear envelope. All nuclear proteins are imported from the 
cytoplasm, their site of synthesis. Transfer RNA and messenger RNA (mRNA) are 
exported from the nucleus, their site of synthesis, to the cytoplasm, their site of function. 
5 Processing of small nuclear RNAs involves export into the cytoplasm, assembly with 
proteins and modifications such as hypermethylation to produce small nuclear ribonuclear 
proteins (snRNPs), and subsequent import of the snRNPs back into the nucleus. The 
assembly of ribosomes requires the initial import of ribosomal proteins from the 
cytoplasm, their incorporation with RNA into ribosomal subunits, and export back to the 
10 cytoplasm. (GCrlich, D. and Mattaj, I.W. (1996) Science 271:1513-1518.) 

The transport of proteins and mRNAs across the NPC is selective, dependent on 
nuclear localization signals (NLS), and generally requires association with nuclear 
transport factors. During transcription, capping, splicing, and polyadenylation, nuclear 
mRNAs associate with a large number of proteins. These associations are coordinated 
with the various stages of mRNA synthesis, processing, and subsequent transport into the 
cytoplasm. Some proteins are dissociated before nuclear mRNAs are transported across 
the NPC while others are dissociated shortly after nuclear mRNA transport across the NPC 
and are reimported into the nucleus. Examination of mRNA export mutants in S. pombe 
and £. cerevjgjae show mutations in nuclear proteins which directly associate with mRNA 
and are hypothesized to effect docking of mRNA at the NPC. (Brown, J. A. et al. (1 995) 
J. Biol. Chem. 270: 741 1-7419.) 

Defective nuclear transport may play a role in cancer. The BRCA1 protein 
contains three potential NLSs which interact with importin alpha, and is transported into 
the nucleus by the importin/NPC pathway. In breast cancer cells the BRCA1 protein is 
aberrantly localized in the cytoplasm. Wild-type BRCA1 protein expressed in six 
different breast cancer cell lines was localized in the cytoplasm. However, wild-type 
BRCA1 protein expressed in four non-breast cancer cell lines was localized in the nucleus 
suggesting that the mislocation of the BRCA1 protein in breast cancer cells may be due to 
a defect in the NPC nuclear import pathway. (Chen, C.F. et al. (1996) J. Biol. Chem. 
271:32863-32868.) 

In 10 of 27 cases of breast cancer studied, p53 tumor suppressor protein has been 
found in the cytoplasm. A majority of the p53 cDNAs derived from breast cancers with 
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cytoplasmic p53 protein were wild type. In contrast, p53 cDNAs derived from breast 
cancers with nuclear P 53 protein contained a variety of missense mutations and a nonsense 
mutation. It has been suggested that in some breast cancers, the tumor-suppressing 
activity of P 53 is inactivated by the sequestration of the protein in the cytoplasm away 
5 from its site of action in the celi nucleus. Cytoplasmic wild-type P 53 was also found in 
human cervical carcinoma cell lines. (Moll, U.M. et al. (1992) Proc. Natl. Acad. Sci. USA 
89:7262-7266; and Liang, X.H. et al. (1 993) Oncogene 8:2645-2652.) 

Transport across membranes also depends on transporters, or pumps, membrane- 
spannmg proteins which bind to specific classes of molecules and undergo a series of 
10 conformational changes in order to transfer the bound molecule across a membrane 

Transport by pumps can occur by a passive, concentration-dependant mechanism or can be 
hnked to an energy source such as ATP hydrolysis or an ion gradient. Examples include 
fachtative transporters, the secondary active symporters and antiporters driven by ion 
grad,ents, and active ATP binding cassette transporters involved in multiple-drug 
.5 resistance and targeting of antigenic peptides to MHC Class I molecules. Transported 
substrates range from nutrients and ions to a broad variety of drugs, peptides and proteins 

The energy requirements of most mammalian cells are met through a continuous 
supply of glucose which circulates in the blood. Glucose enters cells through specific 
glucose transporter molecules present in the plasma membrane. The family includes 
20 passive transporters typical of mammalian tissues and active, H(+)-iinked sugar 

transporters from bacteria. These transporters characteristically contain two groups of six 
putafve membrane-spanning alpha-helices separated by large, hydrophilic, cytoplasmic 
regions. Both the N-terminal and C-terminal regions of the sequence are also predicted to 
be cytoplasmic. Biophysical studies on the human erythrocyte glucose transporter indicate 
25 that the membrane-spanning alpha-helices associate to form a hydrophilic channel or a 
substrate-binding cleft extending across the membrane. The mechanism of substrate 
translocation involves alternate exposure of the substrate-binding site to each face of the 
membrane via a conformational change. (Pessin, J. E. and Bell G. I. (1992) Annu Rev 
Physiol 54:91 1-930.) 

30 Transporters play a major role in the regulation of pH, excretion of drugs and the 

cellular K + /Na+ balance. Monocarboxylate anion transporters are proton-coupled 
symporters with a broad substrate specificity that includes L-lactate, pyruvate, and the 
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ketone bodies acetate, acetoacetate. and beta-hydroxybutyrate. At least seven isoforms 
have been identified to date. The isoforms are predicted to have twelve transmembrane 
(TM) helical domains with a large intracellular loop between TM6 and TM7, and play a 
critical role in maintaining intracellular pH by removing the protons that are produced 
5 stoichiometrically with lactate during glycolysis. The best characterized 

H(+)-monocarboxylate transporter is that of the erythrocyte membrane, which transports 
L-lactate and a wide range of other aliphatic monocarboxylates. Other cells possess 
H(+)-linked monocarboxylate transporters with differing substrate and inhibitor 
selectivities. In particular, cardiac muscle and tumor cells have transporters that differ in 

10 their K m values for certain substrates, including stereoselectivity for L- over D-Iaetate, and 
in their sensitivity to inhibitors. There are Na(+)-monocarboxylate cotransporters on the 
luminal surface of intestinal and kidney epithelia, which allow the uptake of lactate, 
pyruvate, and ketone bodies in these tissues. In addition, there are specific and selective 
transporters for organic cations and organic anions in organs including the kidney, 

15 intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the 
ammonium transporter, mediate the secretion of a variety of drugs and endogenous 
metabolites, and contribute to the maintainence of intercellular pH. (Poole, R. C. and 
Halestrap, A.P. (1993) Am. J. Physiol. 264:C761-C782; Price, N. T. etal. (1998) 

20 Biochem. J. 329:321-328; and Martinelle, K, and Haggstrom, I. (1993) J. Biotechnol. 30: 
339-350.) 

ATP-binding cassette (ABC) transporters are members of a superfamily of 
membrane proteins that transport small hydrophilic molecules across biological 
membranes. They are comprised of two homologous halves, each containing two parts: a 

25 transmembrane domain with multiple transmembrane segments and a nucleotide binding 
domain. Mammalian ABC transporters are found either as complete transporters, e.g. the 
multiple drug resistance transporter and the cystic fibrosis transmembrane regulator 
proteins, or as half transporters, e.g., the transporters associated with antigen processing; 
TAP1 and TAP2 proteins, which dimerize to form the active TAP transporter. Two half 

30 ABC transporters have been identified in the human peroxisome membrane: the 

adrenoleukodystrophy protein (ALDP) and the 70-kDa peroxisomal membrane protein 
(PMP70). Mutations in the adrenoleukodystrophy gene cause X-linked 

-6- 
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adrenoleukodystrophy, an inborn error of peroxisomal P-oxidation of very long chain fatty 
acids. Mutations in the PMP70 genes have been found in patients with Zellweger 
syndrome, an inborn error of peroxisome biogenesis. Multidrug resistance (MDR) results 
from overproduction of another member of the ABC transporter family, P-glycoprotein. 
5 MDR is primarily due to increased drug extrusion from the resistant cells by P- 

glycoprotein. The P-glycoproteins have 2 homologous halves, each with 6 hydrophobic 
segments adjacent to a consensus sequence for nucleotide binding. The hydrophobic 
segments may form a membrane channel, whereas the nucleotide binding site may be 
involved in energization of drug transport. (Saurin, W. et al. (1994) Mol. Microbiol. 
10 12:993-1004; Shani, N., et al. (1996) J. Biol. Chem. 271:8725-8730; and Koster, W., and 
Bohm, B. (1 992) Mol. & Gen. Genet. 232:399-407.) 

The etiology of numerous human diseases and disorders can be attributed to 
defects in the transport of proteins through membranes, to organelles or the cell surface. 
Defects in the trafficking of membrane-bound transporters and ion channels are associated 
15 with several disorders, e.g. cystic fibrosis, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, von Gierke disease, and certain forms of diabetes mellitus. 
Abnormal hormonal secretion is linked to disorders, e.g., diabetes insipidus (vasopressin), 
hyper- and hypoglycemia (insulin, glucagon), Grave's disease and goiter (thyroid 
hormone), and Cushing's and Addison's diseases (adrenocorticotropic hormone). Single- 
20 gene defect diseases resulting in an inability to transport small molecules across 

membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease, 
(van't Hoff, W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G. M. et al. (1994) Ann. 
Intern. Med. 120:218-226; and Chillon, M. et al. (1995) NewEng. J. Med. 
332:1475-1480.) 

25 Cancer cells secrete excessive amounts of hormones or other biologically active 

peptides. Disorders related to excessive secretion of biologically active peptides by tumor 
cells include, e.g., fasting hypoglycemia due to increased insulin secretion from 
insulinoma-islet cell tumors; hypertension due to increased epinephrine and 
norepinephrine secreted from pheochromocytomas of the adrenal medulla and sympathetic 

30 paraganglia; and carcinoid syndmme, which includes abdominal cramps, diarrhea, and 
valvular heart disease, caused by excessive amounts of vasoactive substances, e.g., 
serotonin, bradykinin, histamine, prostaglandins, and polypeptide hormones, secreted from 
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intestinal tumors. Ectopic synthesis and secretion of biologically active peptides includes, 
e.g., ACTH and vasopressin in lung and pancreatic cancers; parathyroid hormone in lung 
and bladder cancers; calcitonin in lung and breast cancers; and thyroid-stimulating 
hormone in medullary thyroid carcinoma. (Schwartz, M.Z. (1997) Semin. Pediatr. Surg. 
5 3:141-146; and Said, S.I. and Faloona, G.R. (1975) N. Engl. J. Med. 293:155-160.) 
The discovery of new human transport-associated molecules and the 
polynucleotides encoding it satisfies a need in the art by providing new compositions 
which are useful in the diagnosis, treatment, and prevention of cancer and transport 
disorders. 

10 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human transport- 
associated molecules, referred to collectively as "TRANP" and individually as "TRANP- 
1", "TRANP-2", "TRANP-3", "TRANP-4", "TRANP-5", "TRANP-6", "TRANP-7", 

15 "TRANP-8", and "TRANP-9" In one aspect, the invention provides a substantially 
purified polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, and fragments thereof. 
The invention further provides a substantially purified variant having at least 90% 

20 amino acid identity to the amino acid sequences of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and 
SEQ ID NO:9, or to a fragment of any of these sequences. The invention also provides an 
isolated and purified polynucleotide sequence encoding the polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, 

25 SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, and SEQ ID NO:9, and fragments thereof. The invention also includes an isolated 
and purified polynucleotide variant having at least 90% polynucleotide identity to the 
polynucleotide sequence encoding the polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 

30 ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof. 

The invention further provides an isolated and purified polynucleotide which 
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hybridizes under stringent conditions to the polynucleotide sequence encoding the 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, and fragments thereof, as well as 
5 an isolated and purified polynucleotide sequence which is complementary to the 

polynucleotide sequence encoding the polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof 

10 The invention also provides an isolated and purified polynucleotide comprising a 

polynucleotide sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID 
NO: 1 1 9 SEQ ID NO: 1 2, SEQ ID NO: 1 3 , SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, 
SEQ ID NO: 1 7, and SEQ ID NO: 1 8, and fragments thereof. The invention further 
provides an isolated and purified polynucleotide variant having at least 90% 
15 polynucleotide identity to the polynucleotide sequence comprising a polynucleotide 

sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l i, SEQ ID 
NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO:l 8, and fragments thereof, as well as an isolated and purified polynucleotide 
which is complementary to the polynucleotide sequence comprising a polynucleotide 
20 selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, and 
SEQ ID NO: 18, and fragments thereof 

The invention further provides an expression vector containing at least a fragment 
of the polynucleotide encoding the polypeptide comprising an amino acid sequence 
25 selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof. In another aspect, the expression vector is contained within a host 
cell. 

The invention also provides a method for producing a polypeptide comprising the 
30 amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ 
ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, or fragments 
thereof, the method comprising the steps of: (a) culturing the host ceil containing an 
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expression vector containing at least a fragment of a polynucleotide sequence encoding 
TRANP under conditions suitable for the expression of the polypeptide; and (b) recovering 
the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a 
5 substantially purified polypeptide selected from the group consisting of the amino acid 
sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, and fragments thereof in 
conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide 

10 comprising the amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, 
or fragments thereof, as well as a purified agonist and a purified antagonist to the 
polypeptide. The invention also provides a method for treating or 

preventing a cancer associated with increased expression or activity of TRANP, the 

15 method comprising administering to a subject in need of such treatment an effective 

amount of an antagonist of the polypeptide selected from the group consisting of SEQ ID 
NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, and fragments thereof. 

The invention also provides a method for treating or preventing a transport disorder 

20 associated with increased expression or activity of TRANP, the method comprising 

administering to a subject in need of such treatment an effective amount of an antagonist 
of the polypeptide selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, or SEQ ID NO:9, and fragments thereof. 

25 The invention also provides a method for treating or preventing a transport disorder 

associated with decreased expression or activity of TRANP, the method comprising 
administering to a subject in need of such treatment an effective amount of the polypeptide 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, 

30 and fragments thereof. 

The invention also provides a method for detecting a polynucleotide encoding a 
polypeptide selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
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N0:3, SEQ ID NO:4 ? SEQ ID NO:5, SEQ ID NO:6 ? SEQ ID NO:7, SEQ ID NO:8, or 
SEQ ID NO:9, and fragments thereof, in a biological sample containing nucleic acids, the 
method comprising the steps of (a) hybridizing the complement of the polynucleotide 
sequence encoding the polypeptide comprising SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
5 NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or 
SEQ ID NO:9, or fragments thereof to at least one of the nucleic acids of the biological 
sample, thereby forming a hybridization complex; and (b) detecting the hybridization 
complex, wherein the presence of the hybridization complex correlates with the presence 
of a polynucleotide encoding TRANP in the biological sample. In one aspect, the nucleic 
10 acids of the biological sample are amplified by the polymerase chain reaction prior to the 
hybridizing step. 



DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
15 understood that this invention is not limited to the particular methodology, protocols, cell 
lines, vectors, and reagents described, as these may vary. It is also to be understood that 
the terminology used herein is for the purpose of describing particular embodiments only, 
and is not intended to limit the scope of the present invention which will be limited only 
by the appended claims. 

20 It must be noted that as used herein and in the appended claims, the singular forms 

"a," "an," and "the" include plural reference unless the context clearly dictates otherwise. 
Thus, for example, a reference to "a host ceil" includes a plurality of such host cells, and a 
reference to "an antibody" is a reference to one or more antibodies and equivalents thereof 
known to those skilled in the art, and so forth. 

25 Unless defined otherwise, all technical and scientific terms used herein have the 

same meanings as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the 
preferred methods, devices, and materials are now described. All publications mentioned 

30 herein are cited for the purpose of describing and disclosing the cell lines, vectors, and 
methodologies which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the 
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invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TRANP," as used herein, refers to the amino acid sequences of substantially 
5 purified TRANP obtained from any species, particularly a mammalian species, including 
bovine, ovine, porcine, murine, equine, and preferably the human species, from any 
source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist," as used herein, refers to a molecule which, when bound to 
TRANP, increases or prolongs the duration of the effect of TRANP. Agonists may 

10 include proteins, nucleic acids, carbohydrates, or any other molecules which bind to and 
modulate the effect of TRANP. 

An "allele" or an "allelic sequence," as these terms are used herein, is an 
alternative form of the gene encoding TRANP. Alleles may result from at least one 
mutation in the nucleic acid sequence and may result in altered mRNAs or in polypeptides 

15 whose structure or function may or may not be altered. Any given natural or recombinant 
gene may have none, one, or many allelic forms. Common mutational changes which give 
rise to alleles are generally ascribed to natural deletions, additions, or substitutions of 
nucleotides. Each of these types of changes may occur alone, or in combination with the 
others, one or more times in a given sequence. 

20 "Altered" nucleic acid sequences encoding TRANP, as described herein, include 

those sequences with deletions, insertions, or substitutions of different nucleotides, 
resulting in a polynucleotide the same TRANP or a polypeptide with at least one 
functional characteristic of TRANP. Included within this definition are polymorphisms 
which may or may not be readily detectable using a particular oligonucleotide probe of the 

25 polynucleotide encoding TRANP, and improper or unexpected hybridization to alleles, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TRANP. The encoded protein may also be "altered," and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result 
in a functionally equivalent TRANP. Deliberate amino acid substitutions may be made on 

30 the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilknty, and/or 
the amphipathic nature of the residues, as long as the biological or immunological activity 
of TRANP is retained. For example, negatively charged amino acids may include aspartic 
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acid and glutamic acid, positively charged amino acids may include lysine and arginine, 
and amino acids with uncharged polar head groups having similar hydrophilicity values 
may include leucine, isoleucine, and valine; glycine and alanine; asparagine and 
glutamine; serine and threonine; and phenylalanine and tyrosine. 
5 The terms "amino acid" or "amino acid sequence," as used herein, refer to an 

oligopeptide, peptide, polypeptide, or protein sequence, or a fragment of any of these, and 
to naturally occurring or synthetic molecules. In this context, "fragments", "immunogenic 
fragments", or "antigenic fragments" refer to fragments of TRANP which are preferably 
about 5 to about 15 amino acids in length and which retain some biological activity or 
1 0 immunological activity of TRANP. Where "amino acid sequence" is recited herein to 
refer to an amino acid sequence of a naturally occurring protein molecule, "amino acid 
sequence" and like terms are not meant to limit the amino acid sequence to the complete 
native amino acid sequence associated with the recited protein molecule. 

"Amplification," as used herein, relates to the production of additional copies of a 
1 5 nucleic acid sequence. Amplification is generally carried out using polymerase chain 
reaction (PCR) technologies well known in the art. (See, e.g., Dieffenbach, C.W. and 
G.S. Dveksler (1995) PCR Primer, a Laboratory Cold Spring Harbor Press, 

Plainview, NY, pp. 1-5.) 

The term "antagonist," as it is used herein, refers to a molecule which, when bound 
20 to TRANP, decreases the amount or the duration of the effect of the biological or 
immunological activity of TRANP. Antagonists may include proteins, nucleic acids, 
carbohydrates, antibodies, or any other molecules which decrease the effect of TRANP. 

As used herein, the term "antibody" refers to intact molecules as well as to 
fragments thereof, such as Fa, F(ab') 2 , and Fv fragments, which are capable of binding the 
25 epitopic determinant. Antibodies that bind TRANP polypeptides can be prepared using 
intact polypeptides or using fragments containing small peptides of interest as the 
immunizing antigen. The polypeptide or oligopeptide used to immunize an animal (e.g., a 
mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized 
chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
30 that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the 
animal. 
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The term "antigenic determinant," as used herein, refers to that fragment of a 
molecule (i.e., an epitope) that makes contact with a particular antibody. When a protein 
or a fragment of a protein is used to immunize a host animal, numerous regions of the 
protein may induce the production of antibodies which bind specifically to antigenic 
5 determinants (given regions or three-dimensional structures on the protein). An antigenic 
determinant may compete with the intact antigen (i.e., the immunogen used to elicit the 
immune response) for binding to an antibody. 

The term "antisense," as used herein, refers to any composition containing a 
nucleic acid sequence which is complementary to a specific nucleic acid sequence. The 
10 term "antisense strand" is used in reference to a nucleic acid strand that is complementary 
to the "sense" strand. Antisense molecules may be produced by any method including 
synthesis or transcription. Once introduced into a cell, the complementary nucleotides 
combine with natural sequences produced by the cell to form duplexes and to block either 
transcription or translation. The designation "negative" can refer to the antisense strand, 
15 and the designation "positive" can refer to the sense strand. 

As used herein, the term "biologically active," refers to a protein having structural, 
regulatory, or biochemical functions of a naturally occurring molecule. Likewise, 
"immunologically active" refers to the capability of the natural, recombinant, or synthetic 
TRANP, or of any oligopeptide thereof, to induce a specific immune response in 
20 appropriate animals or cells and to bind with specific antibodies. 

The terms "complementary" or "complementarity," as used herein, refer to the 
natural binding of polynucleotides under permissive salt and temperature conditions by 
base pairing. For example, the sequence "A-G-T" binds to the complementary sequence 
"T-C-A." Complementarity between two single-stranded molecules may be "partial," 
25 such that only some of the nucleic acids bind, or it may be "complete," such that total 
complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. This is of particular 
importance in amplification reactions, which depend upon binding between nucleic acids 
30 strands, and in the design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence," as these terms are used herein, refer broadly to 
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any composition containing the given polynucleotide or amino acid sequence. The 
composition may comprise a dry formulation, an aqueous solution, or a sterile 
composition. Compositions comprising polynucleotide sequences encoding TRANP or 
fragments of TRANP may be employed as hybridization probes. The probes may be stored 
5 in freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. 
In hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., 
NaCl), detergents (e.g., SDS), and other components (e.g., Denhardt's solution, dry milk, 
salmon sperm DNA, etc.). 

The phrase "consensus sequence," as used herein, refers to a nucleic acid sequence 
10 which has been resequenced to resolve uncalled bases, extended using XL-PCR™ (Perkin 
Elmer, Norwalk, CT) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from the overlapping sequences of more than one Incyte Clone using a 
computer program for fragment assembly, such as the GEL VIEW™ Fragment Assembly 
system (GCG, Madison, WI). Some sequences have been both extended and assembled to 
1 5 produce the consensus sequence . 

As used herein, the term "correlates with expression of a polynucleotide" indicates 
that the detection of the presence of nucleic acids, the same or related to a nucleic acid 
sequence encoding TRANP, by northern analysis is indicative of the presence of nucieic 
acids encoding TRANP in a sample, and thereby correlates with expression of the 
20 transcript from the polynucleotide encoding TRANP. 

A "deletion," as the term is used herein, refers to a change in the amino acid or 
nucleotide sequence that results in the absence of one or more amino acid residues or 
nucleotides. 

The term "derivative," as used herein, refers to the chemical modification of 
25 TRANP, of a polynucleotide sequence encoding TRANP, or of a polynucleotide sequence 
complementary to a polynucleotide sequence encoding TRANP. Chemical modifications 
of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A 
30 derivative polypeptide is one modified by glycosylation, pegylation, or any similar process 
that retains a at least one biological or immunological function of the polypeptide from 
which it was derived. 
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The term "homology," as used herein, refers to a degree of complementarity. 
There may be partial homology or complete homology. The word "identity" may 
substitute for the word "homology." A partially complementary sequence that at least 
partially inhibits an identical sequence from hybridizing to a target nucleic acid is referred 
5 to as "substantially homologous." The inhibition of hybridization of the completely 
complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of 
reduced stringency. A substantially homologous sequence or hybridization probe will 
compete for and inhibit the binding of a completely homologous sequence to the target 

10 sequence under conditions of reduced stringency. This is not to say that conditions of 
reduced stringency are such that non-specific binding is permitted, as reduced stringency 
conditions require that the binding of two sequences to one another be a specific (i.e., a 
selective) interaction. The absence of non-specific binding may be tested by the use of a 
second target sequence which lacks even a partial degree of complementarity (e.g., less 

15 than about 30% homology or identity). In the absence of non-specific binding, the 
substantially homologous sequence or probe will not hybridize to the second non- 
complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. 

20 Percent identity can be determined electronically, e.g., by using the MegAlign program 
(DNASTAR, Inc., Madison WI). The MegAlign program can create alignments between 
two or more sequences according to different methods, e.g., the clustal method. (Higgins, 
D.G. and P. M. Sharp (1988) Gene 73:237-244.) The clustal algorithm groups sequences 
into clusters by examining the distances between all pairs. The clusters are aligned 

25 pairwise and then in groups. The percentage similarity between two amino acid 
sequences, e.g., sequence A and sequence B, is calculated by dividing the length of 
sequence A, minus the number of gap residues in sequence A, minus the number of gap 
residues in sequence B, into the sum of the residue matches between sequence A and 
sequence B, times one hundred. Gaps of low or of no homology between the two amino 

30 acid sequences are not included in determining percentage similarity. Percent identity 
between nucleic acid sequences can also be calculated or counted by the clustal method, or 
by other methods known in the art, such as the Jotun Hein Method. (See, e.g., Hein, J. 
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(1 990) Methods in Enzymology 1 83:626-645.) Identity between sequences can also be 
determined by other methods known in the art, e.g., by varying hybridization conditions. 

"Human artificial chromosomes" (HACs), as described herein, are linear 
microchromosomes which may contain DNA sequences of about 6 kb to 10 Mb in size, 
5 and which contain all of the elements required for stable mitotic chromosome segregation 
and maintenance. (See, e.g., Harrington, J.J. et al. (1997) Nat Genet 15:345-355.) 

The term "humanized antibody," as used herein, refers to antibody molecules in 
which the amino acid sequence in the non-antigen binding regions has been altered so that 
the antibody more closely resembles a human antibody, and still retains its original 
10 binding ability. 

"Hybridization," as the term is used herein, refers to any process by which a strand 
of nucleic acid binds with a complementary strand through base pairing. 

As used herein, the term "hybridization complex" as used herein, refers to a 
complex formed between two nucleic acid sequences by virtue of the formation of 
15 hydrogen bonds between complementary bases. A hybridization complex may be formed 
in solution (e.g., C 0 t or R^t analysis) or formed between one nucleic acid sequence present 
in solution and another nucleic acid sequence immobilized on a solid support (e.g., paper, 
membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which 
cells or their nucleic acids have been fixed). 
20 The words "insertion" or "addition," as used herein, refer to changes in an amino 

acid or nucleotide sequence resulting in the addition of one or more amino acid residues or 
nucleotides, respectively, to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, 
immune disorders, or infectious or genetic disease, etc. These conditions can be 
25 characterized by expression of various factors, e.g., cytokines, chemokines, and other 
signaling molecules, which may affect cellular and systemic defense systems. 

The term "microarray," as used herein, refers to an array of distinct polynucleotides 
or oligonucleotides arrayed on a substrate, such as paper, nylon or any other type of 
membrane, filter, chip, glass slide, or any other suitable solid support. 
30 The term "modulate," as it appears herein, refers to a change in the activity of 

TRANP. For example, modulation may cause an increase or a decrease in protein activity, 
binding characteristics, or any other biological, functional, or immunological properties of 
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TRANP. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to an 
oligonucleotide, nucleotide, polynucleotide, or any fragment thereof, to DNA or RNA of 
genomic or synthetic origin which may be single-stranded or double-stranded and may 
5 represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA- 
like or RNA-like material. In this context, "fragments" refers to those nucleic acid 
sequences which are greater than about 60 nucleotides in length, and most preferably are at 
least about 100 nucleotides, at least about 1000 nucleotides, or at least about 10,000 
nucleotides in length. 

10 The terms "operably associated" or "operably linked," as used herein, refer to 

functionally related nucleic acid sequences. A promoter is operably associated or operably 
linked with a coding sequence if the promoter controls the transcription of the -encoded 
polypeptide. While operably associated or operably linked nucleic acid sequences can be 
contiguous and in reading frame, certain genetic elements, e.g., repressor genes, are not 

15 contiguously linked to the encoded polypeptide but still bind to operator sequences that 
control expression of the polypeptide. 

The term "oligonucleotide," as used herein, refers to a nucleic acid sequence of at 
least about 6 nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and 
most preferably about 20 to 25 nucleotides, which can be used in PGR amplification or in 

20 a hybridization assay or microarray. As used herein, the term "oligonucleotide" is 

substantially equivalent to the terms "amplimers," "primers," "oligomers," and "probes," 
as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA), as used herein, refers to an antisense molecule or 
anti-gene agent which comprises an oligonucleotide of at least about 5 nucleotides in 

25 length linked to a peptide backbone of amino acid residues ending in lysine. The terminal 
lysine confers solubility to the composition. PNAs preferentially bind complementary 
single stranded DNA and RNA and stop transcript elongation, and may be pegylated to 
extend their lifespan in the cell. (See, e.g., Nielsen, P.E. et aL (1993) Anticancer Drug 
Des. 8:53-63.) 

30 The term "sample," as used herein, is used in its broadest sense. A biological 

sample suspected of containing nucleic acids encoding TRANP, or fragments thereof, or 
TRANP itself may comprise a bodily fluid; an extract from a cell, chromosome, organelle, 
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or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, in solution or 
bound to a solid support; a tissue; a tissue print; etc. 

As used herein, the terms "specific binding" or "specifically binding" refer to that 
interaction between a protein or peptide and an agonist, an antibody, or an antagonist The 
5 interaction is dependent upon the presence of a particular structure of the protein 
recognized by the binding molecule (i.e., the antigenic determinant or epitope). For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide 
containing the epitope A, or the presence of free unlabeled A, in a reaction containing free 
labeled A and the antibody will reduce the amount of labeled A that binds to the antibody. 
10 As used herein, the term "stringent conditions" refers to conditions which permit 

hybridization between polynucleotide sequences and the claimed polynucleotide 
sequences. Suitably stringent conditions can be defined by, for example, the 
concentrations of salt or formamide in the prehybridization and hybridization solutions, or 
by the hybridization temperature, and are well known in the art. In particular, stringency 
15 can be increased by reducing the concentration of salt, increasing the concentration of 
formamide, or raising the hybridization temperature. 

For example, hybridization under high stringency conditions could occur in about 
50% formamide at about 37°C to 42°C. Hybridization could occur under reduced 
stringency conditions in about 35% to 25% formamide at about 30°C to 35°G. In 
20 particular, hybridization could occur under high stringency conditions at 42°C in 50% 
formamide, 5X SSPE, 0.3% SDS, and 200 Mg/ml sheared and denatured salmon sperm 
DNA. Hybridization could occur under reduced stringency conditions as described above, 
but in 35% formamide at a reduced temperature of 35°C. The temperature range 
corresponding to a particular level of stringency can be further narrowed by calculating the 
25 purine to pyrimidine ratio of the nucleic acid of interest and adjusting the temperature 
accordingly. Variations on the above ranges and conditions are well known in the art. 

The term "substantially purified," as used herein, refers to nucleic acid or amino 
acid sequences that are removed from their natural environment and are isolated or 
separated, and are at least about 60% free, preferably about 75% free, and most preferably 
30 about 90% free from other components with which they are naturally associated. 

A "substitution," as used herein, refers to the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, respectively. 
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"Transformation," as defined herein, describes a process by which exogenous DNA 
enters and changes a recipient cell. Transformation may occur under natural or artificial 
conditions according to various methods well known in the art, and may rely on any 
known method for the insertion of foreign nucleic acid sequences into a prokaryotic or 
5 eukaryotic host cell. The method for transformation is selected based on the type of host 
cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable 
of replication either as an autonomously replicating plasmid or as part of the host 

10 chromosome, and refers to cells which transiently express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of TRANP, as used herein, refers to an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" changes, 
wherein a substituted amino acid has similar structural or chemical properties <e.g., 

15 replacement of leucine with isoleucine). More rarely, a variant may have 

"nonconservative" changes (e.g., replacement of glycine with tryptophan). Analogous 
minor variations may also include amino acid deletions or insertions, or both. Guidance in 
determining which amino acid residues may be substituted, inserted, or deleted without 
abolishing biological or immunological activity may be found using computer programs 

20 well known in the art, for example, DNASTAR software. 

THE INVENTION 

The invention is based on the discovery of new human transport-associated 
molecules (TRANP), the polynucleotides encoding TRANP, and the use of these 
25 compositions for the diagnosis, treatment, or prevention of cancer and transport disorders. 
Table 1 shows the sequence identification numbers, Incyte Clone identification number, 
and cDNA library for each of the human transport-associated molecules disclosed herein. 

Table 1 

30 



PROTEIN 


NUCLEOTIDE 


CLONE ID 


LIBRARY 


SEQ ID NO:l 


SEQ ID NO: 10 


144861 


TLYMNOR01 
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SEQ ID NO:2 


r— — — 

SEQ ID NO: 11 


607812 


COLNNOT01 


SEQ ID NO:3 


SEQ ID NO: 12 


1259384 


MENITUT03 


SEQ ID NO:4 


SEQ ID NO: 13 


1340813 


COLNTUT03 


SEQ ID NO:5 


SEO ID NO- 14 


I oo5/73 1 


PROSTUT10 


SEQ ID NO:6 


SEQ ID NO: 15 


2751730 


THPIAZS08 


SEQ ID NO:7 


SEQ ID NO: 16 


2794975 


NPOLNOT01 


SEQ ID NO:8 


SEQ ID NO: 17 ~ 


2797710 ~~~ 


NPOLNOT01 


SEQ ID NO:9 


SEQ ID NO: 18 


2914719 


THYMFET03 



10 



15 



Nucleic acids encoding the TRANP-1 of the present invention were first identified 
n» Incyte Clone 144861 from the mononuclear cell cDNA library (TLYMNOR01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 10, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 144861 (TLYMNOR01) and 156223 (THP1PLB02). 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO.l. TRANP-1 is 383 amino acids in length and has 
two potential sugar transport protein signatures from residues L 42 to R„ and from residues 
A 230 to S 246 . TRANP-1 has chemical and structural homology with a sugar transporter 
from sugar beet (GI 1209756). In particular, TRANP and GI 1209756; share 3 0 o/ 0 identity 
20 The fragment of SEQ ID NO:10 from about nucleotide 442 to nucleotide 483 is useful for 
hybridization. Northern analysis shows the expression of this sequence in 
hematopoietic/immune, cardiovascular, and urologic cDNA libraries. Approximately 60% 
of these libraries are associated with fetal tissues and proliferating cell lines and 40% with 
inflammation. 

Nucleic acids encoding the TRANP-2 of the present invention were first identified 
»n Incyte Clone 607812 from the colon tissue cDNA library (COLNNOT01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO:l 1, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 607812 (COLNNOT01), 3 1 761 1 7 (UTRSTUT04), and the 
30 shotgun sequences SAEA02352 and SAEC 1 0448. 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:2. TRANP-2 is 272 amino acids in length and has a 
potential N-glycosolation site at residue N 96 , a potential casein kinase II phosphorylation 
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site at residue T 48 , four potential protein kinase C phosphorylation sites at residues T, 5g , 
Si63> T 25i> and S 265 , and a leucine zipper pattern from residue L 62 to residue L 83 . TRANP-2 
has chemical and structural homology with the C elegans ammonium transporter (GI 
1 125753). In particular, TRANP-2 and GI 1 125753 share 47% identity. The fragment of 
5 SEQ ID NO:l 1 from about nucleotide 251 to nucleotide 274 is useful for hybridization. 
Northern analysis shows the expression of this sequence in reproductive, cardiovascular, 
gastrointestinal, neural, and urologic cDNA libraries. Approximately 67% of these 
libraries are associated with cancer, and 28% with fetal tissues and proliferating cell lines. 
Nucleic acids encoding the TRANP-3 of the present invention were first identified 

10 in Incyte Clone 1259384 from the meningioma cDNA library (MENITUT03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 12, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 1259384 (MENITUT03)and 2946062 (BRAITUT23). 

In one embodiment, the invention encompasses a polypeptide comprising the 

15 amino acid sequence of SEQ ID NO:3. TRANP-3 is 21 0 amino acids in length and has a 
potential amidation site at residue D 55 , a potential N-glycosoIation site at residue N g3 , five 
potential casein kinase II phosphorylation sites at residues Stq, S^, T 88 , S, i5 , and S l99 , four 
potential protein kinase C phosphorylation sites at residues T 45 , S TO , T 85 , and S,^, and two 
potential tyrosine phosphorylation sites at residues and Y,,,. TRANP-3 has chemical 

20 and structural homology with the zeta-COP coatomer protein from cow (GI 441486). In 
particular, TRANP-3 and GI 441486 share 63% identity. The fragment of SEQ ID NO:12 
from about nucleotide 63 to nucleotide 125 is useful for hybridization. Northern analysis 
shows the expression of this sequence in reproductive, neural, cardiovascular, 
gastrointestinal, developmental, endocrine, musculoskeletal, and hematopoietic/immune 

25 cDNA libraries. Approximately 59% of these libraries are associated with cancer, and 
1 8% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-4 of the present invention were first identified 
in Incyte Clone 1340813 from the colon tumor cDNA library (COLNTUT03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 

30 NO: 13, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 1 3408 1 3 and 1 343253 (COLNTUT03), 1 5 1 5390 
(PANCTUT01), 7581 10 (BRAITUT02), 1573508 (LNODNOT03), 1855515 ( 
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HNT3A2T01), and 1971226 (UCMCL5T01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:4. TRANP-4 is 465 amino acids in length and has 
chemical and structural homology with the rat monocarboxylate transporter (GI 2463651). 
5 In particular, TRANP-4 and GI 246365 1 share 90% identity. The fragment of SEQ ID 
NO: 13 from about nucleotide 674 to nucleotide 694 is useful for hybridization. Northern 
analysis shows the expression of this sequence in hematopoietic/immune, gastrointestinal, 
reproductive, cardiovascular, neural, musculoskeletal, developmental, urologic, and 
dermatologic cDNA libraries. Approximately 41% of these libraries are associated with 
1 o cancer, 32% with inflammation, and 23% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-5 of the present invention were first identified 
in Incyte Clone 1689731 from the prostate tumor cDNA library (PROSTUT10) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 14, was derived from the following overlapping and/or extended nucleic acid 
1 5 sequences: Incyte Clones 1 68973 1 (PROSTUT1 0), 1 572293 (UTRSNOT05), 1 807228 
(SINTNOT13), 1556505 (BLADTUT04), 1295734 (PCANNOT03) 1721666 
(BLADNOT06), and the shotgun sequences SAEA02030, SAEA00859, and SAEA02913. 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:5. TRANP-5 is 237 amino acids in length and has a 
20 potential ATP/GTP-binding site motif from residues G l6 to S 23 , and a potential prenylation 
site from residues C 234 to P 2J7 . TRANP-5 has chemical and structural homology with 
mouse rab23 (GI 438162). In particular, TRANP-5 and GI 438162 share 93% identity. 
The fragment of SEQ ID NO:14 from about nucleotide 721 to nucleotide 759 is useful for 
hybridization. Northern analysis shows the expression of this sequence in reproductive, 
25 cardiovascular, neural, gastrointestinal, urologic, developmental, endocrine, and 

dermatological cDNA libraries. Approximately 53% of these libraries are associated with 
cancer, 13% with inflammation, and 13% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-6 of the present invention were first identified 
in Incyte Clone 2751730 from the mononuclear cell cDNA library (THP1AZS08) using a 
30 computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 15, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2751730 and 2754090 (THP1AZS08), 2174086 (ENDCNOT03), 
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and 342996 (BRSTNOR01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:6. TRANP-6 is 208 amino acids in length and has a 
potential ATP/GTP -binding site motif from residues G 20 to T 27 . TRANP-6 has chemical 
5 and structural homology with human GTP binding protein (GI 550072). In particular, 
TRANP-6 and GI 550072 share 90% identity. The fragment of SEQ ID NO: 15 from about 
nucleotide 551 to nucleotide 577 is useful for hybridization. Northern analysis shows the 
expression of this sequence in neural, reproductive, hematopoietic/immune, 
developmental, and dermatologic cDNA libraries. Approximately 43% of these libraries 

10 are associated with fetal tissues and proliferating cell lines, and 28% with cancer. 

Nucleic acids encoding the TRANP-7 of the present invention were first identified 
in Incyte Clone 2794975 from the nasal polyp tissue cDNA library (NPOLNOT01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 16, was derived from the following overlapping and/or extended nucleic acid 

15 sequences: Incyte Clones 2794975 (NPOLNOT01), 3722893 (BRSTNOT23), 2922316 
(SININOT04), 2624003 (KERANOT02), 1394517 (THYRNOT03), 1416892 
(BRAINOT12), 1418058 (KIDNNOT09), 008439 (HMC1NOT01), 1873165 
(LEUKNOT02), and 1370761 (BSTMNON02). 

In one embodiment, the invention encompasses a polypeptide comprising the 

20 amino acid sequence of SEQ ID NO:7. TRANP-7 is 709 amino acids in length and has 
two potential ABC transporter family signatures from residues F 328 to L J39 and from 
residues L 6II toT 625 , and two potential ATP/GTP- binding site motifs from residues G 2I0 
toT 2l7 and from residues G 525 to S 532 . TRANP-7 has chemical and structural homology 
with a £L elegans putative ATP-binding transport protein (GI 500734). In particular, 

25 TRANP-7 and GI 500734 share 47% identity. The fragment of SEQ ID NO: 1 6 from about 
nucleotide 192 to nucleotide 227 is useful for hybridization. Northern analysis shows the 
expression of this sequence in neural, reproductive, hematopoietic/immune, 
gastrointestinal, and cardiovascular cDNA libraries. Approximately 38% of these libraries 
are associated with cancer, 24% with inflammation, and 22% with fetal tissues and 

30 proliferating cell lines. 

Nucleic acids encoding the TRANP-8 of the present invention were first identified 
in Incyte Clone 2797710 from the nasal polyp tissue cDNA library (NPOLNOTOl) using a 
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computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 17, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2797710 (NPOLNOT01), 879491 (THYRNOT02), 234596 
(SINTNOT02), 251 188 (PANCDIT01), 036294 (HUVENOB01), 2939073 ' 
5 (THYMFET02), 2019482 (CONNNOT01), 2171995 (ENDCNOT03), 517825 
(MMLR1DT01), 1714811 (UCMCNOT02), 371259 (LUNGNOT02), 1995866 
(BRSTTUT03), 2642178 (LUNGTUT08), and the shotgun sequences SADA01064 and 
SADA00379. 

In one embodiment, the invention encompasses a polypeptide comprising the 
10 amino acid sequence of SEQ ID NO:8. TRANP-8 is 962 amino acids in length and has 
five potential N-glycosylation sites at residues N 10 „ N 123 , N 243 , N 4Jt , and N 882 , one potential 
tyrosine kinase phosphorylation site at residue Y 8 „ eighteen potential casein kinase II 
Phosphorylation sites at residues T 18 , T M , T 74 , S,„ S 129 , T, 36 , T 410 , T 4J3 , S 58J , S 6J1 , S 632 , S 717 , 
T 7S4 , S 7J8 , S 780 , T M4 , T 890 , and S,o 2 . TRANP-8 has chemical and structural homology with 
15 theratpll5vesicuiartransportfactor(GI538153). In particular, TRANP-8 and 

GI 538153 share 93% identity. The fragment of SEQ ID NO:17 from about nucleotide 
2688 to nucleotide 2723 is useful for hybridization. Northern analysis shows the 
expression of this sequence in reproductive, neural, gastrointestinal, 
hematopoietic/immune, and cardiovascular cDNA libraries. Approximately 45% of these 
20 libraries are associated with cancer, 30% with inflammation, and 1 7% with fetal tissues 
and proliferating cell lines. 

Nucleic acids encoding the TRANP-9 of the present invention were first identified 
in Incyte Clone 2914719 from the fetal thymus cDNA library (THYMFET03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
25 NO: 1 8, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2914719 (THYMFET03), 264268 (HNT2AGT01), 1553167 
(BLADTUT04), 550095 (BEPINOT01), 2148915 (BRAINOT09), and 588157 
(UTRSNOT01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
30 amino acid sequence of SEQ ID NO:9. TRANP-9 is 368 amino acids in length and has 
two potential beta transducin family Trp-Asp (WD-40) repeats from residues V I01 to L, I5 
and from residues V l44 to T I58 . TRANP-9 has chemical and structural homology with £. 
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eiegans poIyA RNA export protein (GI 1546734). In particular, TRANP-9 and 
GI 1546734 share 47% identity. The fragment ofSEQ ID NO: 18 from about nucleotide 
203 to nucleotide 265 is usefiil for hybridization. Northern analysis shows the expression 
of this sequence in reproductive, neural, hematopoietic/immune, and gastrointestinal 
5 cDNA libraries. Approximately 39% of these libraries are associated with cancer, 33% 
with inflammation, and 23% with fetal tissues and proliferating cell lines. 

The invention also encompasses TRANP variants. A preferred TRANP variant is 
one which has at least about 80%, more preferably at least about 90%, and most preferably 
at least about 95% amino acid sequence identity to the TRANP amino acid sequence, and 
10 which contains at least one functional or structural characteristic of TRANP. 

The invention also encompasses polynucleotides which encode TRANP. In a 
particular embodiment, the invention encompasses a polynucleotide consisting of a nucleic 
acid sequence selected from group consisting of SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID 
NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, 
15 and SEQ ID NO: 18. 

The invention also encompasses a variant of a polynucleotide sequence encoding 
TRANP. In particular, such a variant polynucleotide sequence will have at least about 
80%, more preferably at least about 90%, and most preferably at least about 95% 
polynucleotide sequence identity to the polynucleotide sequence encoding TRANP. A 
20 particular aspect of the invention encompasses a variant of a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, 
SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, and 
SEQ ID NO:l 8, which has at least about 80%, more preferably at least about 90%, and 
most preferably at least about 95% polynucleotide sequence identity to the polynucleotide 
25 sequence encoding TRANP. Any one of the polynucleotide variants described above can 
encode an amino acid sequence which contains at least one functional or structural 
characteristic of TRANP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of polynucleotide sequences encoding TRANP, some bearing 
30 minimal homology to the polynucleotide sequences of any known and naturally occurring 
gene, may be produced. Thus, the invention contemplates «ach and every possible 
variation of polynucleotide sequence that could be made by selecting combinations based 
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on possible codon choices. These combinations are made in accordance with the standard 
triplet genetic code as applied to the polynucleotide sequence of naturally occurring 
TRANP, and all such variations are to be considered as being specifically disclosed. 
Although nucleotide sequences which encode TRANP and its variants are 
5 preferably capable of hybridizing to the nucleotide sequence of the naturally occurring 
TRANP under appropriately selected conditions of stringency, it may be advantageous to 
produce nucleotide sequences encoding TRANP or its derivatives possessing a 
substantially different codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or eukaryotic host in 
10 accordance with the frequency with which particular codons are utilized by the host. 

Other reasons for substantially altering the nucleotide sequence encoding TRANP and its 
derivatives without altering the encoded amino acid sequences include the production of 
RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

15 The invention also encompasses production of DNA sequences which encode 

TRANP and TRANP derivatives, or fragments thereof, entirely by synthetic chemistry. 
After production, the synthetic sequence may be inserted into any of the many available 
expression vectors and cell systems using reagents that are well known in the art. 
Moreover, synthetic chemistry may be used to introduce mutations into a sequence 

20 encoding TRANP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable 
of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown 
in SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, or SEQ ID NO: 18, and fragments thereof, 

25 under various conditions of stringency. (See, e.g., Wahl, G.M. and S.L. Berger (1 987) 
Methods Enzymol. 152:399-407; and Kimmel, A.R. (1987) Methods Enzymol. 152:507- 
511.) 

Methods for DNA sequencing are well known and generally available in the art 
and may be used to practice any of the embodiments of the invention. The methods may 
30 employ such enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (US 
Biochemical Corp., Cleveland, OH), Taq polymerase (Perkin Elmer), thermostable T7 
polymerase (Amersham, Chicago, IL), or combinations of polymerases and proofreading 
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exonucleases such as those found in the ELONGASE Amplification System (GlBCO/BRL, 
Gaithersburg, MD). Preferably, the process is automated with machines such as the 
Hamilton Micro Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown, MA) and the ABI Catalyst and 373 and 377 DNA Sequencers 
5 (Perkin Elmer). 

The nucleic acid sequences encoding TRANP may be extended utilizing a partial 
nucleotide sequence and employing various methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal primers to retrieve unknown 

10 sequence adjacent to a known locus. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 
2:3 1 8-322.) In particular, genomic DNA is first amplified in the presence of a primer 
complementary to a linker sequence within the vector and a primer specific to the region 
predicted to encode the gene. The amplified sequences are then subjected to a second 
round of PCR with the same linker primer and another specific primer internal to the first 

15 one. Products of each round of PCR are transcribed with an appropriate RNA polymerase 
and sequenced using reverse transcriptase. 

Inverse PCR may also be used to amplify or extend sequences using divergent 
primers based on a known region. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 
16:81 86.) The primers may be designed using commercially available software such as 

20 OLIGO 4.06 Primer Analysis software (National Biosciences Inc., Plymouth, MN) or 
another appropriate program to be about 22 to 30 nucleotides in length, to have aGC 
content of about 50% or more, and to anneal to the target sequence at temperatures of 
about 68°C to 72°C. The method uses several restriction enzymes to generate a suitable 
fragment in the known region of a gene. The fragment is then circularized by 

25 intramolecular ligation and used as a PCR template. 

Another method which may be used is capture PGR, which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human and yeast 
artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods 
Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and ligations 

30 may be used to place an engineered double-stranded sequence into an unknown fragment 
of the DNA molecule before performing PCR. Other methods which may be used to 
retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) 
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Nucleic Acids Res. 19:3055-3060.) Additionally, one may use PCR, nested primers, and 
PromoterFinder™ libraries to walk genomic DNA (Clontech, Palo Alto, CA). This 
process avoids the need to screen libraries and is useful in finding intron/exon junctions. 
When screening for full-length cDNAs, it is preferable to use libraries that have 
5 been size-selected to include larger cDNAs. Also, random-primed libraries are preferable 
in that they will include more sequences which contain the 5' regions of genes. Use of a 
randomly primed library may be especially preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension 
of sequence into 5* non-transcribed regulatory regions. 
10 Capillary electrophoresis systems which are commercially available may be used 

to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In 
particular, capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) which are laser 
activated, and a charge coupled device camera for detection of the emitted wavelengths. 
15 Output/light intensity may be converted to electrical signal using appropriate software 
(e.g., Genotyper™ and Sequence Navigator™, Perkin Elmer), and the entire process from 
loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. 
20 In another embodiment of the invention, polynucleotide sequences or fragments 

thereof which encode TRANP may be used in recombinant DNA molecules to direct 
expression of TRANP, or fragments or functional equivalents thereof, in appropriate host 
cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid sequence may be 
25 produced, and these sequences may be used to clone and express TRANP. 

As will be understood by those of skill in the art, it may be advantageous to 
produce TRANP-encoding nucleotide sequences possessing non-naturally occurring 
codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can 
be selected to increase the rate of protein expression or to produce an RNA transcript 
30 having desirable properties, such as a half-life which is longer than that of a transcript 
generated from the naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered using 
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methods generally known in the art in order to alter TRANP-encoding sequences for a 
variety of reasons including, but not limited to, alterations which modify the cloning, 
processing, and/or expression of the gene product. DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may 
5 be used to engineer the nucleotide sequences. For example, site-directed mutagenesis may 
be used to insert new restriction sites, alter glycosylation patterns, change codon 
preference, produce splice variants, introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or recombinant nucleic 
acid sequences encoding TRANP may be ligated to a heterologous sequence to encode a 

10 fusion protein. For example, to screen peptide libraries for inhibitors of TRANP activity, 
it may be useful to encode a chimeric TRANP protein that can be recognized by a 
commercially available antibody. A fusion protein may also be engineered to contain a 
cleavage site located between the TRANP encoding sequence and the heterologous protein 
sequence, so that TRANP may be cleaved and purified away from the heterologous 

15 moiety. 

In another embodiment, sequences encoding TRANP may be synthesized, in whole 
or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. 
(1980) Nucl. Acids Res. Symp. Ser. 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. 
Symp. Ser. 225-232.) Alternatively, the protein itself may be produced using chemical 

20 methods to synthesize the amino acid sequence of TRANP, or a fragment thereof. For 
example, peptide synthesis can be performed using various solid-phase techniques. <See, 
e.g., Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be 
achieved using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer). 

The newly synthesized peptide may be substantially purified by preparative high 

25 performance liquid chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) 
Methods Enzymol. 1 82:392-421 .) The composition of the synthetic peptides may be 
confirmed by amino acid analysis or by sequencing. (See, e.g., Creighton, T. (1983) 
Proteins, Structures ar?d Mo)ecu|ar Properties, WH Freeman and Co., New York, NY.) 
Additionally, the amino acid sequence of TRANP, or any part thereof, may be altered 

30 during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide. 

In order to express a biologically active TRANP, the nucleotide sequences 
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encoding TRANP or derivatives thereof may be inserted into appropriate expression 
vector, i.e., a vector which contains the necessary elements for the transcription and 
translation of the inserted coding sequence. 

Methods which are well known to those skilled in the art may be used to construct 
5 expression vectors containing sequences encoding TRANP and appropriate transcriptional 
and translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, 
J. et al. (1 989) Molecular Cloning. A Laboratory Manual , Cold Spring Harbor Press, 
Plainview, NY, ch. 4, 8, and 16-17; and Ausubel, F.M. et al. (1995, and periodic 
1 0 supplements) Current Protocols in Molecular Biology. John Wiley & Sons, New York, 
NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding TRANP. These include, but are not limited to, microorganisms such 
as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA 
15 expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed 
with virus expression vectors (e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic 
virus (TMV)) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. 
20 The invention is not limited by the host cell employed. 

The "control elements" or "regulatory sequences" are those non-translated regions, 
e.g., enhancers, promoters, and 5' and 3' untranslated regions, of the vector and 
polynucleotide sequences encoding TRANP which interact with host cellular proteins to 
carry out transcription and translation. Such elements may vary in their strength and 
25 specificity. Depending on the vector system and host utilized, any number of suitable 

transcription and translation elements, including constitutive and inducible promoters, may 
be used. For example, when cloning in bacterial systems, inducible promoters, e.g., hybrid 
lacZ promoter of the Bluescript® phagemid (Stratagene, La Jolla, CA) or pSportl™ 
plasmid (GlBCO/BRL), may be used. The baculovirus polyhedrin promoter may be used 
30 in insect cells. Promoters or enhancers derived from the genomes of plant cells (e.g., heat 
shock, RUBISCO, and storage protein genes) or from plant viruses (e.g., viral promoters 
or leader sequences) may be cloned into the vector. In mammalian cell systems, 
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promoters from mammalian genes or from mammalian viruses are preferable. If it is 
necessary to generate a cell line that contains multiple copies of the sequence encoding 
TRANP, vectors based on SV40 or EBV may be used with an appropriate selectable 
marker. 

5 In bacterial systems, a number of expression vectors may be selected depending 

upon the use intended for TRANP. For example, when large quantities of TRANP are 
needed for the induction of antibodies, vectors which direct high level expression of fusion 
proteins that are readily purified may be used. Such vectors include, but are not limited to, 
multifunctional E, coli cloning and expression vectors such as Bluescript® (Stratagene), 
10 in which the sequence encoding TRANP may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of B^galactosidase so 
that a hybrid protein is produced. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. 
Biol. Chem. 264:5503-5509.) pGEX vectors (Pharmacia Biotech, Uppsala, Sweden) may 
also be used to express foreign polypeptides as fusion proteins with glutathione 
1 5 S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione. Proteins made in such systems may be designed to include 
heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of 
interest can be released from the GST moiety at will. 
20 In the yeast Saccharomvces cerevjsiae, a number of vectors containing constitutive 

or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, may be used. - 
(See, e.g., Ausubel, supra; and Grant et al. (1987) Methods Enzymol. 153:516-544.) 

In cases where plant expression vectors are used, the expression of sequences 
encoding TRANP may be driven by any of a number of promoters. For example, viral 
25 promoters such as the 35S and 1 9S promoters of CaMV may be used alone or in 

combination with the omega leader sequence from TMV. (Takamatsu, N. (1987) EMBO 
J. 6:307-31 1.) Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
30 Results Probl. Cell Differ. 17:85-105.) These constructs can be introduced into plant cells 
by direct DNA transformation or pathogen-mediated transfeetion. Such techniques are 
described in a number of generally available reviews. <See, e.g., Hobbs, S. or Murry, L.E. 
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in McGraw Hi|l Yearbook of Science and Technology (1 992) McGraw Hill, New York, 
NY; pp. 191-196.) 

An insect system may also be used to express TRANP. For example, in one such 
system > AutQRrapha cajjfornic a nuclear polyhedrosis virus (AcNPV) is used as a vector to 
5 express foreign genes in Spodopfera fru P ipe r da cells or in Trichonhisia larvae. The 
sequences encoding TRANP may be cloned into a non-essential region of the virus, such 
as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful 
insertion of sequences encoding TRANP will render the polyhedrin gene inactive and 
produce recombinant virus lacking coat protein. The recombinant viruses may then be 
10 used to infect, for example, S, frugiperda cells or TrichopI,,^ larvae in which TRANP 
may be expressed. (See, e.g., Engelhard, E.K. et al. (1994) Proc. Nat. Acad. Sci. 
91:3224-3227.) 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, sequences encoding 

1 5 TRANP may be Iigated into an adenovirus transcription/translation complex consisting of 
the late promoter and tripartite leader sequence. Insertion in a non-essential El or E3 
region of the viral genome may be used to obtain a viable virus which is capable of 
expressing TRANP in infected host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the Rous 

20 sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host 
cells. 

Human artificial chromosomes (HACs) may also be employed to deliver larger 
fragments of DNA than can be contained and expressed in a plasmid. HACs of about 6 kb 
to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, 
25 polycationic amino polymers, or vesicles) for therapeutic purposes. 

Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRANP. Such signals include the ATG initiation codon and adjacent 
sequences. In cases where sequences encoding TRANP and its initiation codon and 
upstream sequences are inserted into the appropriate expression vector, no additional 
30 transcriptional or translation^ control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translation^ control 
signals including the ATG initiation codon should be provided. Furthermore, the initiation 
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codon should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers appropriate for the particular cell system used. (See, e.g., Scharf, D. et al. 
5 (1994) Results Probl. Cell Differ. 20:125-162.) 

In addition, a host cell strain may be chosen for its ability to modulate expression 
of the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 

10 which cleaves a "prepro" form of the protein may also be used to facilitate correct 
insertion, folding, and/or function. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 
Collection (ATCC, Bethesda, MD) and may be chosen to ensure the correct modification 

15 and processing of the foreign protein. 

For long term, high yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines capable of stably expressing TRANP can be 
transformed using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate 

20 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 
to 2 days in enriched media before being switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced sequences. Resistant clones of 
stably transformed cells may be proliferated using tissue culture techniques appropriate to 

25 the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase genes and 
adenine phosphoribosyltransferase genes, which can be employed in tk or apr oells, 
respectively. (See, e.g., Wigler, M. et al. (1 977) Cell 1 1 :223-232; and Lowy, I. et al. 
30 (1980) Cell 22:817-823) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; npt 
confers resistance to the aminoglycosides neomycin and G-41 8; and als or pat confer 
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resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; Colbere-Garapin, F. et al 
(1981) J. Mol. Biol. 150:1-14; and Murry, supia.) Additional selectable genes have been 
described, e.g., trpB, which allows cells to utilize indole in place of tryptophan, or hisD, 
5 which allows cells to utilize histinol in place of histidine. (See, e.g., Hartman, S.C. and 
R.C. Mulligan (1988) Proc. Natl: Acad. Sci. 85:8047-8051.) Recently, the use of visible 
markers has gained popularity with such markers as anthocyanins, 6 glucuronidase and its 
substrate GUS, luciferase and its substrate luciferin. Green fluorescent proteins (GFP) 
(Clontech, Palo Alto, CA) are also used (See, e.g., Chalfie, M. et al. (1994) Science 
) 263:802-805.) These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific 
vector system. (See, e.g., Rhodes, C.A. et al. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. 
For example, if the sequence encoding TRANP is inserted within a marker gene sequence, 
transformed cells containing sequences encoding TRANP can be identified by the absence 
of marker gene function. Alternatively, a marker gene can be placed in tandem with a 
sequence encoding TRANP under the control of a single promoter. Expression of the 
marker gene in response to induction or selection usually indicates expression of the 
tandem gene as well. 

Alternatively, host cells which contain the nucleic acid sequence encoding TRANP 
and express TRANP may be identified by a variety of procedures known to those of skill 
in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridizations and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid 
or protein sequences. 

The presence of polynucleotide sequences encoding TRANP can be detected by 
DNA-DNA or DNA-RNA hybridization or amplification using probes or fragments or 
fragments of polynucleotides encoding TRANP. Nucleic acid amplification based assays 
involve the use of oligonucleotides or oligomers based on the sequences encoding TRANP 
to detect transformants containing DNA or RNA encoding TRANP. 

A variety of protocols for detecting and measuring the expression of TRANP, 
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using either polyclonal or monoclonal antibodies specific for the protein, are known in the 
art. Examples of such techniques include enzyme-linked immunosorbent assays 
(ELISAs), radioimmunoassays (RJAs), and fluorescence activated cell sorting (FACS). A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
5 non-interfering epitopes on TRANP is preferred, but a competitive binding assay may be 
employed. These and other assays are well described in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul, MN, Section 
IV; and Maddox, D.E. et al. (1983) J. Exp. Med. 158:121 1-1216). 

A wide variety of labels and conjugation techniques are known by those skilled in 

10 the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding TRANP include oligolabeling, nick translation, end-labeling, or 
PCR amplification using a labeled nucleotide. Alternatively, the sequences encoding 
TRANP, or any fragments thereof, may be cloned into a vector for the production of an 

15 mRNA probe. Such vectors are known in the art, are commercially available, and may be 
used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using 
a variety of commercially available kits, such as those provided by Pharmacia & Upjohn 
(Kalamazoo, MI), Promega (Madison, WI), and U.S. Biochemical Corp. (Cleveland, OH). 

20 Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRANP may be 
cultured under conditions suitable for the expression and recovery of the protein from cell 

25 culture. The protein produced by a transformed cell may be secreted or contained 

intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides which encode 
TRANP may be designed to contain signal sequences which direct secretion of TRANP 
through a prokaryotic or eukaryotic cell membrane. Other constructions may be used to 

30 join sequences encoding TRANP to nucleotide sequences encoding a polypeptide domain 
which will facilitate purification of soluble proteins. Such purification facilitating 
domains include, but are not limited to, metal chelating peptides such as 
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histidine-tryptophan modules that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized 
in the FLAGS extension/affinity purification system (Immunex Corp., Seattle, WA). The 
inclusion of cleavable linker sequences, such as those specific for Factor XA or 
5 enterokinase (Invitrogen, Sari Diego, CA), between the purification domain and the 
TRANP encoding sequence may be used to facilitate purification. One such expression 
vector provides for expression of a fusion protein containing TRANP and a nucleic acid 
encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage site. 
The histidine residues facilitate purification on immobilized metal ion affinity 

10 chromatography. (IMAC) (See, e.g., Porath, J. et al. (1992) Prot. Exp. Purif. 3: 263-281.) 
The enterokinase cleavage site provides a means for purifying TRANP from the fusion 
protein. (See, e.g., Kroll, D.J. et al. (1993) DNA Cell Biol. 12:441-453.) 

Fragments of TRANP may be produced not only by recombinant production, but 
also by direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, T.E. 

15 (1984) Protein: Structures and Molecular Properties, pp. 55-60, W.H. Freeman and Co., 
New York, NY.) Protein synthesis may be performed by manual techniques or by 
automation. Automated synthesis may be achieved, for example, using the Applied 
Biosystems 431 A Peptide Synthesizer (Perkin Elmer). Various fragments of TRANP may 
be synthesized separately and then combined to produce the full length molecule. 

20 

THERAPEUTICS 

Chemical and structural homology exists among the human transport-associated 
proteins of the invention. In addition, TRANP is expressed in cancer. Therefore, TRANP 
appears to play a role in cancer and transport disorders. In cancer and transport disorders 

25 where expression of TRANP is increased, or where expression of TRANP is promoting 
cancer and transport disorders, it is desirable to decrease the expression of TRANP. In 
transport disorders where expression of TRANP is decreased, it is desirable to provide the 
protein or increase the expression of TRANP. 

Therefore, in one embodiment, TRANP or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a transport disorder associated with decreased 
expression or activity of TRANP. Such transport disorders include, but are not limited to, 
adrenoleukodystrophy, cystic fibrosis, cystinuria, cystine nephrolithiasis, Dubin- Johnson 
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syndrome, glucose-galactose malabsorption syndrome, diabetes mellitus, diabetes 
insipidus, diastrophic dysplasia, Dubin-Johnson syndrome, Fanconi disease, Fanconi- 
Bickel syndrome, Hartup disease, hyperbilirubinemia, hypercholesterolemia, 
hyperinsulinemia, hyper- and hypoglycemia, iminoglycinuria, Grave's disease, goiter, 
5 Cushing's disease, and Addison's disease, von Gierk disease, Zellweger syndrome; 
gastrointestinal disorders including ulcerative colitis, gastric and duodenal ulcers; other 
conditions associated with abnormal vesicle trafficking including AIDS; allergies 
including hay fever, asthma, and urticaria (hives); autoimmune hemolytic anemia, 
proliferative glomerulonephritis; inflammatory bowel disease, multidrug resistance, 
10 multiple sclerosis; myasthenia gravis, myocardial ischemia, rheumatoid and osteoarthritis, 
Parkinson's disease, Pendred syndrome, scleroderma, Bartter's and Gitelman's syndromes, 
Chediak-Higashi and Sjogren's syndromes, Stargardt's disease, systemic lupus 
erythematosus, and Wilson's disease. 

In another embodiment, a vector capable of expressing TRANP or a fragment or 
15 derivative thereof may be administered to a subject to treat or prevent a transport disorder 
including, but not limited to, those listed above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified TRANP in conjunction with a suitable pharmaceutical carrier may be 
administered to a subject to treat or prevent a transport disorder including, but not limited 
20 to, those provided above. 

In still another embodiment, an agonist which modulates the activity of TRANP 
may be administered to a subject to treat or prevent a transport disorder associated with 
increased expression or activity of TRANP. 

In one embodiment, an antagonist which modulates the activity of TRANP may be 
25 administered to a subject to treat or prevent a cancer associated with increased expression 
or activity of TRANP. Such cancers can include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, 
cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
30 parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus. In one aspect, an antibody which specifically binds TRANP may be used directly 
as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
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pharmaceutical agent to cells or tissue which express TRANP. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding TRANP may be administered to a subject to treat or prevent a 
cancer including, but not limited to, those listed above. 
5 In a further embodiment, an antagonist which modulates the activity of TRANP 

may be administered to a subject to treat or prevent a transport disorder associated with 
increased expression or activity of TRANP. Such transport disorders include, but are not 
limited to, those listed above. In one aspect, an antibody which specifically binds TRANP 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism 
10 for bringing a pharmaceutical agent to cells or tissue which express TRANP. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding TRANP may be administered to a subject to treat or prevent a 
transport disorder including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
15 complementary sequences, or vectors of the invention may be administered in combination 
with other appropriate therapeutic agents. Selection of the appropriate agents for use in 
combination therapy may be made by one of ordinary skill in the art, according to 
conventional pharmaceutical principles. The combination of therapeutic agents may act 
synergistically to effect the treatment or prevention of the various disorders described 
20 above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRANP may be produced using methods which are generally 
known in the art. In particular, purified TRANP may be used to produce antibodies or to 
screen libraries of pharmaceutical agents to identify those which specifically bind TRANP. 
25 Antibodies to TRANP may also be generated using methods that are well known in the art. 
Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and 
single chain antibodies, Fab fragments, and fragments produced by a Fab expression 
library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are especially 
preferred for therapeutic use. 
30 For the production of antibodies, various hosts including goats, rabbits, rats, mice, 

humans, and others may be immunized by injection with TRANP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, 
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various adjuvants may be used to increase immunological response. Such adjuvants 
include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and 
surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 
5 Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to TRANP have an amino acid sequence consisting of at least about 5 amino 
acids, and, more preferably, of at least about 10 amino acids. It is also preferable that 
these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 
10 sequence of the natural protein and contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of TRANP amino acids may be fused with 
those of another protein, such as KLH, and antibodies to the chimeric molecule may be 
produced. 

Monoclonal antibodies to TRANP may be prepared using any technique which 

15 provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique. (See, e.g., Kohler, G. et at. 
(1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 81:31-42; Cote, 
RJ. et al. (1983) Proc. Natl. Acad. Sci. 80:2026-2030; and Cole, S.P. et al, (1984) Mol. 

20 Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such 
as the splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity, can be used. (See, e.g., 
Morrison, S.L. et al. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. 

25 (1984) Nature 312:604-608; and Takeda, S. et al. (1985) Nature 314:452-454.) 

Alternatively, techniques described for the production of single chain antibodies may be 
adapted, using methods known in the art, to produce TRANP-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may 
be generated by chain shuffling from random combinatorial immunoglobulin libraries. 

30 (See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
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specific binding reagents as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) 
Proc. Natl. Acad. Sci. 86: 3833-3837; and Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRANP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
5 produced by pepsin digestion of the antibody molecule and Fab fragments generated by 
reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. (See, e.g., Huse, W.D. et al. (1989) Science 
246:1275-1281.) 

> Various immunoassays may be used for screening to identify antibodies having the 

desired specificity. Numerous protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established specificities are 
well known in the art. Such immunoassays typically involve the measurement of complex 
formation between TRANP and its specific antibody. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering TRANP 
epitopes is preferred, but a competitive binding assay may also be employed. (Maddox, 
supra .) 

In another embodiment of the invention, the polynucleotides encoding TRANP, or 
any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
the complement of the polynucleotide encoding TRANP may be used in situations in 
which it would be desirable to block the transcription of the mRNA. In particular, cells 
may be transformed with sequences complementary to polynucleotides encoding TRANP. 
Thus, complementary molecules or fragments may be used to modulate TRANP activity, 
or to achieve regulation of gene function. Such technology is now well known in the art, 
and sense or antisense oligonucleotides or larger fragments can be designed from various 
locations along the coding or control regions of sequences encoding TRANP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 
viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue, or cell population. Methods which are well known 
to those skilled in the art can be used to construct vectors which will express nucleic acid 
sequences complementary to the polynucleotides of the gene encoding TRANP. (See, e.g., 
Sambrook, supra; and Ausubel, supra .) 
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Genes encoding TRANP can be turned off by transforming a cell or tissue with 
expression vectors which express high levels of a polynucleotide, or fragment thereof, 
encoding TRANP. Such constructs may be used to introduce untranslatable sense or 
antisense sequences into a cell. Even in the absence of integration into the DNA, such 
5 vectors may continue to transcribe RNA molecules until they are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non-replicating 
vector, and may last even longer if appropriate replication elements are part of the vector 
system. 

As mentioned above, modifications of gene expression can be obtained by 

10 designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the 
control, 5\ or regulatory regions of the gene encoding TRANP. Oligonucleotides derived 
from the transcription initiation site, e.g., between about positions -10 and +10 from the 
start site, are preferred. Similarly, inhibition can be achieved using triple helix 
base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the 

15 ability of the double helix to open sufficiently for the binding of polymerases, transcription 
factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al (1994) in Hiiber, B.E. and B.I. 
Carr, Molecular and Immunologic Approaches- Futura Publishing Co., Mt. Kisco, NY, pp. 
1 63-1 77.) A complementary sequence or antisense molecule may also be designed to 

20 block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific 
cleavage of RNA. The mechanism of ribozyme action involves sequence-specific 
hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. For example, engineered hammerhead motif ribozyme 

25 molecules may specifically and efficiently catalyze endonucleolytic cleavage of sequences 
encoding TRANP. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites, including the 
following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of 
30 between 1 5 and 20 ribonucleotides, corresponding to the region of the target gene 

containing the cleavage site, may be evaluated for secondary structural features which may 
render the oligonucleotide inoperable. The suitability of candidate targets may also be 
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evaluated by testing accessibility to hybridization with complementary oligonucleotides 
using ribonuciease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. 
5 These include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 

in vitro and in vivo transcription of DNA sequences encoding TRANP. Such DNA 
sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 

10 synthesize complementary RNA, constitutively or inducibly, can be introduced into cell 
lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5' and/or 3* ends of the molecule, or the use of phosphorothioate or T O-methyl rather 

15 than phosphodiesterase linkages within the backbone of the molecule. This concept is 
inherent in the production of PNAs and can be extended in all of these molecules by the 
inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as 
acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

20 Many methods for introducing vectors into cells or tissues are available and 

equally suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be 
introduced into stem cells taken from the patient and clonally propagated for autologous 
transplant back into that same patient. Delivery by transfection, by liposome injections, or 
by polycationic amino polymers may be achieved using methods which are well known in 

25 the art: (See, e.g., Goldman, CK. et al. (1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in 
need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, 
rabbits, monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 

30 pharmaceutical or sterile composition, in conjunction with a pharmaceutical ly acceptable 
carrier, for any of the therapeutic effects discussed above. Such pharmaceutical 
compositions may consist of TRANP, antibodies to TRANP, and mimetics, agonists, 
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antagonists, or inhibitors of TRANP. The compositions may be administered alone or in 
combination with at least one other agent, such as a stabilizing compound, which may be 
administered in any sterile, biocompatible pharmaceutical carrier including, but not limited 
to, saline, buffered saline, dextrose, and water. The compositions may be administered to a 
5 patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, oral, intravenous, intramuscular, 
intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, 
intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

10 In addition to the active ingredients, these pharmaceutical compositions may 

contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into preparations which can be used 
pharmaceutical^. Further details on techniques for formulation and administration may 
be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing 

15 Co., Easton, PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 

20 ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules 
(optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be 
added, if desired. Suitable excipients include carbohydrate or protein fillers, such as 

25 sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, 
potato, or other plants; cellulose, such as methyl cellulose, 

hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including 
arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating 
or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, 
30 and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as 
concentrated sugar solutions, which may also contain gum arabic, talc, 
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polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may 
be added to the tablets or dragee coatings for product identification or to characterize the 
quantity of active compound, i.e., dosage. 
5 Pharmaceutical preparations which can be used orally include push-fit capsules 

made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or 
binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 

10 suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with 
or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution, Ringer's solution, or physiologically buffered saline. Aqueous injection 

15 suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds may be prepared as appropriate oily injection suspensions. Suitable 
lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid 
esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino 
20 polymers may also be used for delivery. Optionally, the suspension may also contain 
suitable stabilizers or agents to increase the solubility of the compounds and allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier 
to be permeated are used in the formulation. Such penetrants are generally known in the 
25 art. 

The pharmaceutical compositions of the present invention may be manufactured in 
a manner that is known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 
lyophilizing processes. 

30 The pharmaceutical composition may be provided as a salt and can be formed with 

many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, and succinic acid. Salts tend to be more soluble in aqueous or other protonic 
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solvents than are the corresponding free base forms. In other cases, the preferred 
preparation may be a lyophilized powder which may contain any or all of the following: 1 
mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a pH range of 4.5 
to 5.5, that is combined with buffer prior to use. 
5 After pharmaceutical compositions have been prepared, they can be placed in an 

appropriate container and labeled for treatment of an indicated condition. For 
administration of TRANP, such labeling would include amount, frequency, and method of 
administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
10 wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those 
skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, 

15 rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate 
concentration range and route of administration. Such information can then be used to 
determine useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for 
example TRANP or fragments thereof, antibodies of TRANP, and agonists, antagonists or 

20 inhibitors of TRANP, which ameliorates the symptoms or condition. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in cell cultures or 
with experimental animals, such as by calculating the ED50 (the dose therapeutically 
effective in 50% of the population) or LD50 (the dose lethal to 50% of the population) 
statistics. The dose ratio of therapeutic to toxic effects is the therapeutic index, and it can 

25 be expressed as the ED50/LD50 ratio. Pharmaceutical compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in 
such compositions is preferably within a range of circulating concentrations that includes 
the ED50 with little or no toxicity. The dosage varies within this range depending upon 

30 the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related 
to the subject requiring treatment. Dosage and administration are adjusted to provide 
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sufficient levels of the active moiety or to maintain the desired effect. Factors which may 
be taken into account include the severity of the disease state, the general health of the 
subject, the age, weight, and gender of the subject, time and frequency of administration, 
drug combination(s), reaction sensitivities, and response to therapy. Long-acting 
5 pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 ^g to 100,000 /ig, up to a total 
dose of about 1 gram, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature and generally 
10 available to practitioners in the art. Those skilled in the art will employ different 

formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of 
polynucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. 

15 DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TRANP may be used 
for the diagnosis of disorders characterized by expression of TRANP, or in assays to 
monitor patients being treated with TRANP or agonists, antagonists, or inhibitors of 
TRANP. Antibodies useful for diagnostic purposes may be prepared in the same manner 

20 as described above for therapeutics. Diagnostic assays for TRANP include methods which 
utilize the antibody and a label to detect TRANP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be 
labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be 

25 used. 

A variety of protocols for measuring TRANP, including ELISAs, RIAs, and 
FACS, are known in the art and provide a basis for diagnosing altered or abnormal levels 
of TRANP expression. Normal or standard values for TRANP expression are established 
by combining body fluids or cell extracts taken from normal mammalian subjects, 
30 preferably human, with antibody to TRANP under conditions suitable for complex 
formation The amount of standard complex formation may be quantitated by various 
methods, preferably by photometric means. Quantities of TRANP expressed in subject, 
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control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing 
disease. 

In another embodiment of the invention, the polynucleotides encoding TRANP 
5 may be used for diagnostic purposes. The polynucleotides which may be used include 
oligonucleotide sequences, complementary RNA and DNA molecules, and PNAs. The 
polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in 
which expression of TRANP may be correlated with disease. The diagnostic assay may be 
used to determine absence, presence, and excess expression of TRANP, and to monitor 

10 regulation of TRANP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding TRANP or closely 
related molecules may be used to identify nucleic acid sequences which encode TRANP. 
The specificity of the probe, whether it is made from a highly specific region, e.g., the 5* 

15 regulatory region, or from a less specific region, e.g., a conserved motif, and the 

stringency of the hybridization or amplification (maximal, high, intermediate, or low), will 
determine whether the probe identifies only naturally occurring sequences encoding 
TRANP, alleles, or related sequences. 

Probes may also be used for the detection of related sequences, and should 

20 preferably contain at least 50% of the nucleotides from any of the TRANP encoding 
sequences. The hybridization probes of the subject invention may be DNA or RNA and 
may be derived from the sequence of SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, or SEQ 
ID NO: 10, or from genomic sequences including promoters, enhancers, and introns of the 

25 TRANP gene. 

Means for producing specific hybridization probes for DNAs encoding TRANP 
include the cloning of polynucleotide sequences encoding TRANP or TRANP derivatives 
into vectors for the production of mRNA probes. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by means of 
30 the addition of the appropriate RNA polymerases and the appropriate labeled nucleotides. 
Hybridization probes may be labeled by a variety of reporter groups, for example, by 
radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
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coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRANP may be used for the diagnosis of a 
disorder associated with expression of TRANP. Examples of such a disorder include, but 
are not limited to, transport disorders such as adrenoleukodystrophy, cystic fibrosis, 
5 cystinuria, cystine nephrolithiasis, Dubin- Johnson syndrome, glucose-galactose 
malabsorption syndrome, diabetes mellitus, diabetes insipidus, diastrophic dysplasia, 
Dubin- Johnson syndrome, Fanconi disease, Fanconi-Bickel syndrome, Hartup disease, 
hyperbilirubinemia, hypercholesterolemia, hyperinsulinemia, hyper- and hypoglycemia, 
iminoglycinuria, Grave's disease, goiter, Cushing's disease, and Addison's disease, von 
10 Gierk disease, Zellweger syndrome; gastrointestinal disorders including ulcerative colitis, 
gastric and duodenal ulcers; other conditions associated with abnormal vesicle trafficking 
including AIDS; allergies including hay fever, asthma, and urticaria (hives); autoimmune 
hemolytic anemia, proliferative glomerulonephritis; inflammatory bowel disease, 
multidrug resistance, multiple sclerosis; myasthenia gravis, myocardial ischemia, 
15 rheumatoid and osteoarthritis, Parkinson's disease, Pendred syndrome, scleroderma, 
Banter's and Gitelman's syndromes, Chediak-Higashi and Sjogren's syndromes, 
Stargardt's disease, systemic lupus erythematosus, and Wilson's disease; and cancer such 
as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
20 marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. The polynucleotide sequences encoding 
TRANP may be used in Southern or northern analysis, dot blot, or other membrane-based 
technologies; in PCR technologies; in dipstick, pin, and ELISA assays; and in microarrays 
25 utilizing fluids or tissues from patients to detect altered TRANP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding TRANP may be useful in 
assays that detect the presence of associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding TRANP may be labeled by standard methods and 
30 added to a fluid or tissue sample from a patient under conditions suitable for the formation 
of hybridization complexes. After a suitable incubation period, the sample is washed and 
the signal is quantitated and compared with a standard value. If the amount of signal in 
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the patient sample is significantly altered in comparison to a control sample then the 
presence of altered levels of nucleotide sequences encoding TRANP in the sample 
indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in 
5 clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with 
expression of TRANP, a normal or standard profile for expression is established. This 
may be accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence, or a fragment thereof, encoding 
10 TRANP, under conditions suitable for hybridization or amplification. Standard 

hybridization may be quantified by comparing the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. 
15 Deviation from standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of 
expression in the patient begins to approximate that which is observed in the normal 
subject. The results obtained from successive assays may be used to show the efficacy of 
20 treatment over a period ranging from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in 
biopsied tissue from an individual may indicate a predisposition for the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of 
actual clinical symptoms. A more definitive diagnosis of this type may allow health 
25 professionals to employ preventative measures or aggressive treatment earlier thereby 
preventing the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences 
encoding TRANP may involve the use of PGR. These oligomers may be chemically 
synthesized, generated enzymatically, or produced in vitro . Oligomers will preferably 
30 contain a fragment of a polynucleotide encoding TRANP, or a fragment of a 

polynucleotide complementary to the polynucleotide encoding TRANP, and will be 
employed under optimized conditions for identification of a specific gene or condition. 
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Oligomers may also be employed under less stringent conditions for detection or 
quantitation of closely related DNA or RNA sequences. 

Methods which may also be used to quantitate the expression of TRANP include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
5 interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1 993) J. 

Immunol. Methods 159:235-244; and Duplaa, C. et al. (1993) Anal. Biochem. 229-236.) 
The speed of quantitation of multiple samples may be accelerated by running the assay in 
an ELISA format where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

1 0 In further embodiments, oligonucleotides or longer fragments derived from any of 

the polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes 
simultaneously and to identify genetic variants, mutations, and polymorphisms. This 
information may be used to determine gene function, to understand the genetic basis of a 

1 5 disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. 
(See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) 
Proc. Natl. Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application 
20 W095/251 1 16; Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. 
(1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 
5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding TRANP 
may be used to generate hybridization probes useful in mapping the naturally occurring 

25 genomic sequence. The sequences may be mapped to a particular chromosome, to a 
specific region of a chromosome, or to artificial chromosome constructions, e.g., human 
artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial 
chromosomes (BACs), bacterial PI constructions, or single chromosome cDNA libraries. 
(See, e.g., Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 

30 7:149-154.) 

Fluorescent in_siiu hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. 
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(1995) in Meyers, R.A. (ed.) Molecular Biology and Biotechnology . VCH Publishers New 
York, NY, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between 
the location of the gene encoding TRANP on a physical chromosomal map and a specific 
5 disorder, or a predisposition to a specific disorder, may help define the region of DNA 
associated with that disorder. The nucleotide sequences of the invention may be used to 
detect differences in gene sequences among normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping 
techniques, such as linkage analysis using established chromosomal markers, may be used 

10 for extending genetic maps. Often the placement of a gene on the chromosome of another 
mammalian species, such as mouse, may reveal associated markers even if the number or 
arm of a particular human chromosome is not known. New sequences can be assigned to 
chromosomal arms by physical mapping. This provides valuable information to 
investigators searching for disease genes using positional cloning or other gene discovery 

15 techniques. Once the disease or syndrome has been crudely localized by genetic linkage to 
a particular genomic region, e.g., AT to 1 1 q22-23, any sequences mapping to that area 
may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, 
R.A. et al. (1 988) Nature 336:577-580.) The nucleotide sequence of the subject invention 
may also be used to detect differences in the chromosomal location due to translocation, 

20 inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRANP, its catalytic or immunogenic 
fragments, or oligopeptides thereof can be used for screening libraries of compounds in 
any of a variety of drug screening techniques. The fragment employed in such screening 
may be free in solution, affixed to a solid support, borne on a cell surface, or located 

25 intracellularly. The formation of binding complexes between TRANP and the agent being 
tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, 
et al. (1984) PCT application WO84/03564.) In this method, large numbers of different 

30 small test compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The test compounds are reacted with TRANP, or fragments thereof, and 
washed. Bound TRANP is then detected by methods well known in the art. Purified 



WO 99/41373 



PCT/US99/02527 



TRANP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
5 neutralizing antibodies capable of binding TRANP specifically compete with a test 
compound for binding TRANP. In this manner, antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with TRANP. 

In additional embodiments, the nucleotide sequences which encode TRANP may 
be used in any molecular biology techniques that have yet to be developed, provided the 
10 new techniques rely on properties of nucleotide sequences that are currently known, 
including, but not limited to, such properties as the triplet genetic code and specific base 
pair interactions. 

The examples below are provided to illustrate the subject invention and are not 
included for the purpose of limiting the invention. 

15 

EXAMPLES 

For purposes of example, the preparation and sequencing of the PROSTUT10 
cDNA library, from which Incyte Clone 1689731 was isolated, is described below. 
Preparation and sequencing of cDNAs in libraries in the LIFESEQ 7 ^ database have varied 
20 over time, and the gradual changes involved use of kits, plasmids, and machinery available 
at the time the library was made and analyzed. 

I. PROSTUT10 cDNA Library Construction 

The PROSTUT10 cDNA library was constructed from prostate tumor tissue 
25 obtained from a 66-year-old Caucasian male. The tissue was excised when the patient 
underwent a radical prostatectomy and regional lymph node excision. The pathology 
report indicated a prostate tumor Gleason grade (2+3) involving the left and right side 
centrally. The tumor was confined and did not involve the capsule. Perineural invasion 
was absent. Initially, the patient presented with elevated prostate specific antigen (PSA). 
30 The patient history included benign hypertension and alcohol use. The patient's family 
history included a malignant neoplasm of the prostate and a malignant neoplasm of bone 
and articular cartilage in the patient's father, and benign hypertension the patient's sibling. 
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The frozen tissue was homogenized and lysed using a Brinkmann Homogenizer 
Polytron PT-3000 (Brinkmann Instruments, Westbury, NJ) in guanidinium isothiocyanate 
solution. The lysate was centrifuged over a 5.7 M CsCl cushion using an Beckman S W28 
rotor in a Beckman L8-70M Ultracentrifuge (Beckman Instruments) for 18 hours at 25,000 
5 rpm at ambient temperature. The RNA was extracted with acid phenol pH 4.7, 
precipitated using 0.3 M sodium acetate and 2.5 volumes of ethanol, resuspended in 
RNAse-free water, and DNase treated at 37°C. RNA extraction and precipitation were 
repeated as before. The mRNA was isolated with the Qiagen Oligotex kit (QIAGEN, Inc., 
Chatsworth, CA) and used to construct the cDN A libraries. 

10 The mRNA was handled according to the recommended protocols in the 

Superscript Plasmid System (Cat. #18248-013, Gibco/BRL, Gaithersburg, MD). 
PROSTUT10 cDNAs were fractionated on a Sepharose CL4B column (Cat. #275105-01, 
Pharmacia), and those cDNAs exceeding 400 bp were ligated into pINCY. The plasmid 
pINCY was subsequently transformed into DH5a™ competent cells (Cat. #18258-012, 

15 Gibco/BRL). 

II Isolation and Sequencing of cDN A Clones 

Plasmid DNA was released from the cells and purified using the REAL Prep 96 
Plasmid Kit (Catalog #26173, QIAGEN, Inc.). The recommended protocol was employed 

20 except for the following changes: 1) the bacteria were cultured in 1 ml of sterile Terrific 
Broth (Catalog #2271 1, Gibco/BRL) with carbenicillin at 25 mg/L and glycerol at 0.4%; 
2) after inoculation, the cultures were incubated for 19 hours and at the end of incubation, 
the cells were lysed with 0.3 ml of lysis buffer; and 3) following isopropanol precipitation, 
the plasmid DNA pellet was resuspended in 0.1 ml of distilled water. After the last step in 

25 the protocol, samples were transferred to a 96- well block for storage at 4° C. 

The cDNAs were sequenced by the method of Sanger et al. (1975, J. Mol. Biol. 
94:44 If), using a Hamilton Micro Lab 2200 (Hamilton, Reno, NV) in combination with 
Peltier Thermal Cyclers (PTC200 from MJ Research, Watertown, MA) and Applied 
Biosystems 377 DNA Sequencing Systems; and the reading frame was determined. 



III. Homology Searching of cDNA Clones and Their Deduced Proteins 

The nucleotide sequences and/or amino acid sequences of the Sequence Listing 
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were used to query sequences in the GenBank, SwissProt, BLOCKS, and Pima II 
databases. These databases, which contain previously identified and annotated sequences, 
were searched for regions of homology using BLAST (Basic Local Alignment Search 
Tool). (See, e.g., Altschul, S.F. (1993) J. Mol. Evol 36:290-300; and Altschul et al. (1990) 
5 J. Mol. Biol. 215:403-410.) 

BLAST produced alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the alignments, BLAST was 
especially useful in determining exact matches or in identifying homologs which may be 
of prokaryotic (bacterial) or eukaryotic (animal, fungal, or plant) origin. Other algorithms 
10 could have been used when dealing with primary sequence patterns and secondary 

structure gap penalties. (See, e.g., Smith, T. et al. (1992) Protein Engineering 5:35-51.) 
The sequences disclosed in this application have lengths of at least 49 nucleotides and 
have no more than 12% uncalled bases (where N is recorded rather than A, C, G, or T). 

The BLAST approach searched for matches between a query sequence and a 
15 database sequence. BLAST evaluated the statistical significance of any matches found, 
and reported only those matches that satisfy the user-selected threshold of significance. In 
this application, threshold was set at 10* 25 for nucleotides and 10* 8 for peptides. 

Incyte nucleotide sequences were searched against the GenBank databases for 
primate (pri), rodent (rod), and other mammalian sequences (mam), and deduced amino 
20 acid sequences from the same clones were then searched against GenBank functional 
protein databases, mammalian (mamp), vertebrate (vrtp), and eukaryote (eukp), for 
homology. 

Additionally, sequences identified from cDNA libraries may be analyzed to 
identify those gene sequences encoding conserved protein motifs using an appropriate 

25 analysis program, e.g., the Block 2 Bioanalysis Program (Incyte, Palo Alto, CA). This 
motif analysis program, based on sequence information contained in the Swiss-Prot 
Database and PROSITE, is a method of determining the function of uncharacterized 
proteins translated from genomic or cDNA sequences. (See, e.g., Bairoch, A. et al. (1997) 
Nucleic Acids Res. 25:217-221; and Attwood, T. K. et al. (1997) J. Chem. Inf. Comput. 

30 Sci. 37:417-424.) PROSITE may be used to identify common functional or structural 

domains in divergent proteins. The method is based on weight matrices. Motifs identified 
by this method are then calibrated against the SWISS-PROT database in order to obtain a 
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measure of the chance distribution of the matches. 

In another alternative, Hidden Markov models (HMMs) may be used to find 
protein domains, each defined by a dataset of proteins known to have a common biological 
function. (See, e.g., Pearson, W.R. and D.J. Lipman (1988) Proc. Natl. Acad. Sci. 
5 85:2444-2448; and Smith, T.F. and M.S. Waterman (1981) J. Mol. Biol. 147:195-197.) 
HMMs were initially developed to examine speech recognition patterns, but are now being 
used in a biological context to analyze protein and nucleic acid sequences as well as to 
model protein structure. (See, e.g., Krogh, A. et al. (1994) J. Mol. Biol. 235:1501-1531; 
and Collin, M. et al. (1993) Protein Sci. 2:305-314.) HMMs have a formal probabilistic 
10 basis and use position-specific scores for amino acids or nucleotides. The algorithm 

continues to incorporate information from newly identified sequences to increase its motif 
analysis capabilities. 

IV. Northern Analysis 

15 Northern analysis is a laboratory technique used to detect the presence of a 

transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 
membrane on which RNAs from a particular cell type or tissue have been bound. {See, 
e.g., Sambrook, supra , ch. 7; and Ausubel, F.M. et al. supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST are used to search for identical 

20 or related molecules in nucleotide databases such as GenBank or LIFESEQ™ database 
(Incyte Pharmaceuticals). This analysis is much faster than multiple membrane-based 
hybridizations. In addition, the sensitivity of the computer search can be modified to 
determine whether any particular match is categorized as exact or homologous. 
The basis of the search is the product score, which is defined as: 

25 

% sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences 
and the length of the sequence match. For example, with a product score of 40, the match 
30 will be exact within a 1% to 2% error, and, with a product score of 70, the match will be 
exact. Homologous molecules are usually identified by selecting those which show 
product scores between 1 5 and 40, although lower scores may identify related molecules. 
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The results of northern analysis are reported as a list of libraries in which the 
transcript encoding TRANP occurs. Abundance and percent abundance are also reported. 
Abundance directly reflects the number of times a particular transcript is represented in a 
cDNA library, and percent abundance is abundance divided by the total number of 
5 sequences examined in the cDNA library. 

V. Extension of TRANP Encoding Polynucleotides 

The sequence of one of the polynucleotides of the present invention was used to 
design oligonucleotide primers for extending a partial nucleotide sequence to full length. 
1 0 One primer was synthesized to initiate extension of an antisense polynucleotide, and the 
other was synthesized to initiate extension of a sense polynucleotide. Primers were used 
to facilitate the extension of the known sequence "outward" generating amplicons 
containing new unknown nucleotide sequence for the region of interest. The initial primers 
were designed from the cDNA using OLIGO 4.06 (National Biosciences, Plymouth, MN), 
15 or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC 
content of about 50% or more, and to anneal to the target sequence at temperatures of 
about 68°C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries (GlBCO/BRL) were used to extend the sequence. 
20 If more than one extension is necessary or desired, additional sets of primers are designed 
to further extend the known region. 

High fidelity amplification was obtained by following the instructions for the XL- 
PCR kit (Perkin Elmer) and thoroughly mixing the enzyme and reaction mix. PCR was 
performed using the Peltier Thermal Cycler (PTC200; M.J. Research, Watertown, MA), 
25 beginning with 40 pmol of each primer and the recommended concentrations of all other 
components of the kit, with the following parameters: 

Step 1 94° C for 1 min (initial denaturation) 

Step 2 65° C fori min 

Step 3 68° C for 6 min 

30 Step 4 94° C for 15 sec 

Step 5 65° C for 1 min 

Step 6 68° C for 7 min 

Ste P 7 Repeat steps 4 through 6 for an additional 1 5 cycles 

Step 8 94° C for 15 sec 
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Step 9 65° C for i min 

Step 10 68° C for 7:15 min 

Step 1 1 Repeat steps 8 through 10 for an additional 12 cycles 

Step 12 72° C for 8 min 

5 Step 1 3 4 ° C (and holding) 

A 5 fA to 10 /J aliquot of the reaction mixture was analyzed by electrophoresis on 
a low concentration (about 0.6% to 0.8%) agarose mini-gel to determine which reactions 
were successful in extending the sequence. Bands thought to contain the largest products 

10 were excised from the gel, purified using QIAQuick™ (QIAGEN Inc., Chatsworth, CA), 
and trimmed of overhangs using Klenow enzyme to facilitate religation and cloning. 

After ethanol precipitation, the products were redissolved in 13 m1 of ligation 
buffer, 1/^1 T4-DNA ligase (15 units) and \fA T4 polynucleotide kinase were added, and 
the mixture was incubated at room temperature for 2 to 3 hours, or overnight at 16° C. 

15 Competent E. coli cells (in 40 pil of appropriate media) were transformed with 3 /il of 
ligation mixture and cultured in 80 iA of SOC medium. (See, e.g., Sambrook, supra . 
Appendix A, p. 2.) After incubation for one hour at 37° C, the E. coli mixture was plated 
on Luria Bertani (LB) agar (See, e.g., Sambrook, supra . Appendix A, p. 1) containing 2x 
Carb. The following day, several colonies were randomly picked from each plate and 

20 cultured in 1 50 fA of liquid LB/2x Carb medium placed in an individual well of an 

appropriate commercially-available sterile 96-well microtiter plate. The following day, 5 
A*l of each overnight culture was transferred into a non-sterile 96-well plate and, after 
dilution 1:10 with water, 5 pel from each sample was transferred into a PCR array. 

For PCR amplification, 18 /^l of concentrated PCR reaction mix (3.3x) containing 

25 4 units of rTth DNA polymerase, a vector primer, and one or both of the gene specific 

primers used for the extension reaction were added to each well. Amplification was 

performed using the following conditions: 

Step 1 94° C for 60 sec 

Step 2 94° C for 20 sec 

30 Step 3 55° C for 30 sec 

Step 4 72° C for 90 sec 

Step 5 Repeat steps 2 through 4 for an additional 29 cycles 

Step 6 72° C for 180 sec 

Step 7 4° C (and holding) 
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Aliquots of the PCR reactions were run on agarose gels together with molecular 
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weight markers. The sizes of the PCR products were compared to the original partial 
cDNAs, and appropriate clones were selected, ligated into plasmid, and sequenced. 

The nucleotide sequences of SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13 
SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, and SEQ ID NO: 1 8are 
5 used to obtain 5' regulatory sequences using the procedure above, oligonucleotides 
designed for 5' extension, and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
10 N0.12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
or SEQ ID NO: 1 8 are employed to screen cDNAs, genomic DNAs, or mRNAs. Although 
the labeling of oligonucleotides, consisting of about 20 base pairs, is specifically 
described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
1 5 (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of 
[y- 32 P] adenosine triphosphate (Amersham, Chicago, IL), and T4 polynucleotide kinase 
(DuPont NEN®, Boston, MA). The labeled oligonucleotides are substantially purified 
using a Sephadex G-25 superfine resin column (Pharmacia & Upjohn, Kalamazoo, MI). 
An aliquot containing 1 0 7 counts per minute of the labeled probe is used in a typical 
20 membrane-based hybridization analysis of human genomic DNA digested with one of the 
following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu II (DuPont NEN, 
Boston, MA). 

The DNA from each digest is fractionated on a 0.7 percent agarose gel and 
transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham, NH). 
25 Hybridization is carried out for 1 6 hours at 40°C. To remove nonspecific signals, blots 
are sequentially washed at room temperature under increasingly stringent conditions up to 
0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT ARtm f,i m 
(Kodak, Rochester, NY) is exposed to the blots to film for several hours, hybridization 
patterns are compared visually. 



30 



VII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize 
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array elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .^ An array 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical 
array may be produced by hand or using available methods and machines and contain any 
5 appropriate number of elements. After hybridization, nonhybridized probes are removed 
and a scanner used to determine the levels and patterns of fluorescence. The degree of 
complementarity and the relative abundance of each probe which hybridizes to an element 
on the microarray may be assessed through analysis of the scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 

10 comprise the elements of the microarray. Fragments suitable for hybridization can be 
selected using software well known in the art such as LASERGENE™. Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of 
the present invention, or selected at random from a cDNA library relevant to the present 
invention, are arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed 

1 5 to the slide using, e.g., UV cross-linking followed by thermal and chemical treatments and 
subsequent drying. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; and Shalon, 
D. et al. (1 996) Genome Res. 6:639-645.) Fluorescent probes are prepared and used for 
hybridization to the elements on the substrate. The substrate is analyzed by procedures 
described above. 

20 

VIII. Complementary Polynucleotides 

Sequences complementary to the TRANP-encoding sequences, or any parts 
thereof, are used to detect, decrease, or inhibit expression of naturally occurring TRANP. 
Although use of oligonucleotides comprising from about 15 to 30 base pairs is described, 

25 essentially the same procedure is used with smaller or with larger sequence fragments. 
Appropriate oligonucleotides are designed using Oligo 4.06 software and the coding 
sequence of TRANP. To inhibit transcription, a complementary oligonucleotide is 
designed from the most unique 5' sequence and used to prevent promoter binding to the 
coding sequence. To inhibit translation, a complementary oligonucleotide is designed to 

30 prevent ribosomal binding to the TRANP-encoding transcript. 



IX. Expression of TRANP 



WO 99/41373 PCT/US99/02S27 



Expression of TRANP is accomplished by subcloning the cDNA into an 
appropriate vector and transforming the vector into host cells. This vector contains an 
appropriate promoter, e.g., C-galactosidase upstream of the cloning site, operably 
associated with the cDNA of interest. (See, e.g.,Sambrook, supra, pp. 404-433; and 
5 Rosenberg, M. et al. (1983) Methods Enzymol. 101:123-138.) 

Induction of an isolated, transformed bacterial strain with isopropyl beta-D- 
thiogalactopyranoside (IPTG) using standard methods produces a fusion protein which 
consists of the first 8 residues of B-galactosidase, about 5 to 15 residues of linker, and the 
full length protein. The signal residues direct the secretion of TRANP into bacterial 
1 0 growth media which can be used directly in the following assay for activity. 

X. Demonstration of TRANP Activity 

TRANP transport activity can be demonstrated through the use of a ligand mixing 
assay that is used to measure transport from early to late endosomal compartments in X. 
15 ]aeyjs oocytes. Ovaries are dissected from adult female 2L laevis anesthetized with 
3-aminobenzoic acid ethyl ester (1 g/liter) in ice water, and oocytes are isolated. 
(Mukhopadhyay A, et al. (1997) J. Cell. Biol. 136(6): 1227-1237). Oocytes are pulsed 
with 2mg/ml avidin for 5hrs at 18° C, washed, then incubated for 16hrs to allow avidin to 
transport to a late compartment. The oocytes are then incubated with lmg/ml biotin- 
20 horseradish peroxidase (HRP) for 30 mins at 1 8° C to label early endocytic compartments. 
Varying amounts of TRANP are injected into the oocytes, which are incubated at 1 8° C. 
Oocytes are collected at several time points after TRANP injection, washed, and lysed in 
lOOul of phosphate-buffered saline containing 0.3% Triton X-100, 0.2% 
methylbenzethorium chloride, and 400 ug/ml of BSA-biotin as a scavenger. Finally, the 
25 lysates are centrifuged for 30 seconds in a microfuge, and the avidin-biotin complexes are 
immunoprecipitated using anti-avidin antibody-coated plates by incubation at 4 °C 
overnight. The plates are then washed at least 5 times to remove unbound proteins. 
Transport from the early endosomes to the late compartments is quantified by measuring 
the amount of immunoprecipitated HRP; increased transport due to TRANP is quantitated 
30 by comparison with control oocytes. 

XI. Production of TRANP Specific Antibodies 
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TRANP substantially purified using PAGE electrophoresis (see, e.g., Harrington, 
M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. The TRANP amino 
acid sequence is analyzed using DNASTAR software (DNASTAR Inc) to determine 
5 regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used 
to raise antibodies by means known to those of skill in the art. Methods for selection of 
appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel et al. supra , ch. 11.) 

Typically, the oligopeptides are 15 residues in length, and are synthesized using 

10 an Applied Biosystems Peptide Synthesizer Model 43 1 A using fmoc-chemistry and 
coupled to KLH (Sigma, St. Louis, MO) by reaction with N-maleimidobenzoyl-N- 
hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., Ausubel et al. 
supra.) Rabbits are immunized with the oligopeptide-KLH complex incomplete Freund's 
adjuvant. Resulting antisera are tested for antipeptide activity, for example, by binding the 

15 peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, arid 
reacting with radio-iodinated goat anti-rabbit IgG. 

XII. Purification of Naturally Occurring TRANP Using Specific Antibodies 
Naturally occurring or recombinant TRANP is substantially purified by 

20 immunoaffinity chromatography using antibodies specific for TRANP. An 

immunoaffinity column is constructed by covalently coupling anti-TRANP antibody to an 
activated chromatographic resin, such as CNBr-activated Sepharose {Pharmacia & 
Upjohn). After the coupling, the resin is blocked and washed according to the 
manufacturer's instructions. 

25 Media containing TRANP are passed over the immunoaffinity column, and the 

column is washed under conditions that allow the preferential absorbance of TRANP (e.g., 
high ionic strength buffers in the presence of detergent). The column is eluted under 
conditions that disrupt antibody/TRANP binding (e.g., a buffer of pH 2 to pH 3, or a high 
concentration of a chaotrope, such as urea or thiocyanate ion), and TRANP is collected. 



XIIL Identification of Molecules Which Interact with TRANP 

TRANP, or biologically active fragments thereof, are labeled with l25 I 
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Bolton-Hunter reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with the 
labeled TRANP, washed, and any wells with labeled TRANP complex are assayed. Data 
obtained using different concentrations of TRANP are used to calculate values for the 
5 number, affinity, and association of TRANP with the candidate molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
0 should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . A substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ 

5 ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof 

2. A substantially purified variant having at least 90% amino acid identity to 
the amino acid sequence of claim 1. 

0 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 

1. 

4. An isolated and purified polynucleotide variant having at least 90% 
5 polynucleotide identity to the polynucleotide sequence of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to the polynucleotide sequence of claim 3. 

6. An isolated and purified polynucleotide which is complementary to the 
polynucleotide of claim 3. 

7. An isolated and purified polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
and SEQ ID NO: 1 8, and fragments thereof. 

8. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide identity to the polynucleotide sequence of claim 7. 

9. An isolated and purified polynucleotide which is complementary to the 
polynucleotide of claim 7. 
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10. An expression vector containing at least a fragment of the polynucleotide of 

claim 3. 



11. A host cell containing the expression vector of claim 10. 

5 

12. A method for producing a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ 
ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, and SEQ ID NO: 1 8, or fragments thereof, the 
method comprising the steps of: 

10 a) culturing the host cell of claim 1 1 under conditions suitable for the 

expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

13. A pharmaceutical composition comprising the polypeptide of claim 1 in 
15 conjunction with a suitable pharmaceutical carrier. 

14. A purified antibody which specifically binds to the polypeptide of claim 1 . 

15. A purified agonist of the polypeptide of claim 1 . 

20 

1 6. A purified antagonist of the polypeptide of claim 1 . 

17. A method for treating or preventing a transport disorder, the method 
comprising administering to a subject in need of such treatment an effective amount of the 

25 pharmaceutical composition of claim 1 3. 

18. A method for treating or preventing cancer, the method comprising 
administering to a subject in need of such treatment an effective amount of the antagonist 
of claim 16. 

30 

19. A method for treating or preventing a transport disorder, the method 
comprising administering to a subject in need of such treatment an effective amount of the 
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antagonist of claim 16. 

20. A method for detecting a polynucleotide encoding a polypeptide having the 
amino acid sequence selected from the group SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
5 NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
and SEQ ID NO: 18, and fragments thereof, in a biological sample containing nucleic 
acids, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 6 to at least one of the 
nucleic acids in the biological sample, thereby forming a hybridization complex; 

10 and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide encoding 
the polypeptide in the biological sample. 

15 21 . The method of claim 20 wherein the nucleic acids of the biological sample 

are amplified by the polymerase chain reaction prior to hybridization. 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS. INC. 
AU- YOUNG , Jan i ce 
HILLMAN, Jennifer L. 
LAL, Preeti 
GUEGLER, Karl J. 
CORLEY, Neil C. 
YUE, Henry 
BANDMAN, Olga 
BAUGHN, Mariah R. 

<120> HUMAN TRANSPORT-ASSOCIATED MOLECULES 

<140> To Be Assigned 
<141> Herewith 

<l'50> 09/021,764 
<151> 1998-02-11 

<160> 18 

<170> PERL PROGRAM 

<210> 1 
<211> 383 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 144861 

<400> 1 

Met lie Leu Asn Asp Leu Leu Gly Arg Lys Leu Ser lie Met Phe 
1 5 10 15 

Ser Ala Val Pro Ser Ala Ala Gly Tyr Ala Leu Met Ala Gly Ala 
20 25 30 

His Gly Leu Trp Met Leu Leu Leu Gly Arg Thr Leu Thr Gly Phe 
35 40 45 

Ala Gly Gly Leu Thr Ala Ala Cys He Pro Val Tyr Val Ser Glu 
50 55 60 

He Ala Pro Pro Gly Val Arg Gly Ala Leu Gly Ala Thr Pro Gin 
65 70 75 

Leu Met Ala Val Phe Gly Ser Leu Ser Leu Tyr Ala Leu Gly Leu 
80 85 90 

Leu Leu Pro Trp Arg Trp Leu Ala Val Ala Gly Glu Ala Pro Val 
95 100 105 

Leu He Met He Leu Leu Leu Ser Phe Met Pro Asn Ser Pro Arg 
110 115 120 

Phe Leu Leu Ser Arg Gly Arg Asp Glu Glu Ala Leu Arg Ala Leu 
125 130 135 

Ala Trp Leu Arg Gly Thr Asp Val Asp Val His Trp Glu Phe Glu 
140 145 150 

Gin lie Gin Asp Asn Val Arg Arg Gin Ser Ser Arg Val Ser Trp 
155 160 165 

Ala Glu Ala Arg Ala Pro His Val Cys Arg Pro He Thr Val Ala 
170 175 180 

1 
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Leu 


Leu 


Met 


Arg 


Leu 


Leu Gin Gin Leu Thr Gly 


He 


Thr Pro He 










185 


190 




195 


Leu 


Val 


Tyr 


Leu 


Gin 


Ser He Phe Asp Ser Thr 


Ala 


Val Leu Leu 










200 


205 




210 


Pro 


Pro 


Lys 


Asp Asp 


Ala Ala He Val Gly Ala Val 


Arg Leu Leu 










215 


220 




225 


Ser 


Val 


Leu 


He 


Ala 


Ala Leu Thr Met Asp Leu 


Ala 


Gly Arg Lys 










230 


235 




240 


Val 


Leu 


Leu 


Phe 


Val 


Ser Gly His Leu Ser Val 


His 


Ser Gly His 










245 


250 




255 


His 


Val 


Cys 


Cys 


Gin 


Pro Asp Ser Gly Ala Val 


His 


Pro Leu Trp 










260 


265 




270 


Pro 


Gin 


Ala 


Ser 


Glu 


Pro Gin Gin His Cys Gly Pro Gly Lys Arg 










275 


280 




285 


Val 


Leu 


Gly 


Gly Leu 


Gly Ala Ala Leu Ala Ala 


Pro 


Ala <31y Tyr 










290 


295 




300 


Leu 


Thr 


Leu 


Val 


Pro 


Leu Leu Ala Thr Met Leu 


Phe 


He Met Gly 










305 


310 




315 


Arg 


Cys 


Gly 


Gly Gly 


Ser Glu Gly Gin Ala Val 


Leu Gly Val Lys 










320 


325 




330 


Gly Trp 


Val 


Met 


Cys 


Leu Gly Trp Leu Glu Arg Gly Val Ser Ser 










335 


340 




345 


Ser 


Ser 


Ala 


Glu 


Thr 


Asp Thr Ala Ala Ser Ser 


His 


Pro Thr Gly 










350 


355 




360 


Pro 


Glu 


Pro 


Ala 


Ser 


Ser Ser Glu Gin Ala Thr 


Pro 


Trp Ala Gly 










365 


370 




375 


Val 


His 


His 


Leu 


Ala 


Ala His Val 














380 









<210> 2 
<211> 272 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 607812 

<400> 2 

Met Ala Ser Ser Asn Thr Val Leu Met Arg Leu Val Ala Ser Ala 
15 10 15 

Tyr Ser He Ala Gin Lys Ala Gly Met He Val Arg Arg Val He 

20 25 30 

Ala Glu Gly Asp Leu Gly He Val Glu Lys Thr Cys Ala Thr Asp 

35 40 45 

Leu Gin Thr Lys Ala Asp Arg Leu Ala Gin Met Ser He Cys Ser 

50 55 60 

Ser Leu Ala Arg Lys Phe Pro Lys Leu Thr He He Gly Glu Glu 

65 70 75 

Leu Val Val Trp Val Asp Pro Leu Asp Gly Thr Lys Glu Tyr Thr 

80 85 90 

Glu Gly Leu Leu Asp Asn Val Thr Val Leu He Gly He Ala Tyr 

95 100 105 

Glu Gly Lys Ala He Ala Gly Val He Asn Gin Pro Tyr Tyr Asn 
110 115 120 

Tyr Glu Ala Gly Pro Asp Ala Val Leu Gly Arg Thr He Trp Gly 
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125 








130 




135 


Val 


Leu 


Gly Leu 


Gly Ala 


Phe Gly 


Phe 


Gin Leu Lys 


Glu Val 


Pro 










140 








145 




150 


Ala Gly Lys 


His 


He 


He 


Thr Thr 


Thr 


Arg Ser His 


Ser Asn 


Lys 










155 








160 




165 


Leu 


Val 


Thr 


Asp 


Cys 


Val 


Ala Ala 


Met 


Asn Pro Asp Ala Val 


Leu 










170 








175 




180 


Arg 


Val 


Gly Gly 


Ala Gly Asn Lys 


He 


He Gin Leu 


He Glu Gly 










185 








190 




195 


Lys 


Ala 


Ser 


Ala 


Tyr 


Val 


Phe Ala 


Ser 


Pro Gly Cys 


Lys Lys 


Trp 










200 








205 




210 


Asp 


Thr 


Cys 


Ala 


Pro 


Glu 


Val He 


Leu 


His Ala Val 


Gly Gly Lys 










215 








220 




225 


Leu 


Thr 


Asp 


He 


His 


Gly Asn Val 


Leu 


Gin Tyr His 


Lys Asp 


Val 










230 








235 




240 


Lys 


His 


Met 


Asn 


Ser 


Ala 


Gly Val 


Leu 


Ala Thr Leu 


Arg Asn 


Tyr 










245 








250 




255 


Asp 


Tyr 


Tyr 


Ala 


Ser 


Arg 


val Pro 


Glu 


Ser He Lys 


Asn Ala 


Leu 










260 








265 




270 


Val 


Pro 





















<210> 3 
<211> 210 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 1259384 

<400> 3 

Met Gin Arg Pro Glu Ala Trp Pro Arg Pro His Pro Gly Glu Gly 
1 5 10 15 

Ala Ala Ala Ala Gin Ala Gly Gly Pro Ala Pro Pro Ala Arg Ala 
20 25 30 

Gly Glu Pro Ser Gly Leu Arg Leu Gin Glu Pro Ser Leu Tyr Thr 
35 40 45 

He Lys Ala Val Phe He Leu Asp Asn Asp Gly Arg Arg Leu Leu 
50 55 60 

Ala Lys Tyr Tyr Asp Asp Thr Phe Pro Ser Met Lys Glu Gin Met 
65 70 75 

Val Phe Glu Lys Asn Val Phe Asn Lys Thr Ser Arg Thr Glu Ser 
80 85 90 

Glu lie Ala Phe Phe Gly Gly Met Thr He Val Tyr Lys Asn Ser 
95 100 105 

He Asp Leu Phe Leu Tyr Val Val Gly Ser Ser Tyr Glu Asn Glu 
110 115 120 

Leu Met Leu Met Ser Val Leu Thr Cys Leu Phe Glu Ser Leu Asn 
125 130 135 

His Met Leu Arg Lys Asn Val Glu Lys Arg Trp Leu Leu Glu Asn 
140 145 150 

Met Asp Gly Ala Phe Leu Val Leu Asp Glu He Val Asp Gly Gly 
155 160 165 

Val He Leu Glu Ser Asp Pro Gin Gin Val He Gin Lys Val Asn 
170 175 180 

Phe Arg Ala Asp Asp Gly <31y Leu Thr Glu Gin Ser Val Ala Gin 



3 
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185 






190 




195 


Val 


Leu 


Gin 


Ser 


Ala 


LtyS ulu olll 


lie 


Lys 


T^rs Qot" T.Ait T oil 
L ip OCl bcU JLiCU 


T .\ra 
LjyS 










200 






205 




210 


<210> 4 


















<211> 465 
















<212> PRT 
















<213> Homo sapiens 












<220> - 


















<223> 1340813 














<400> 4 


















Met 


Gly 


Glv 


Ala 


Val 


Val Asp Glu Gly Pro 


Thr Gly Val Lys 


Ala 


1 








5 






10 




15 


Pro 


Asp 


Glv 


Glv 


Trp 


Gly Trp Ala 


Val 


Leu 


Phe Gly Cys Phe 


Val 










20 






25 




30 


lie 


Thr 


Glv 


Phe 


Ser 


Tyr Ala Phe 


Pro 


Lys 


Ala Val Ser Val 


Phe 










35 






40 




45 


Phe 


Lys 


Glu 


Leu 


He 


Gin Glu Phe Gly He 


Gly Tyr Ser Asp 


Thr 










50 






55 




60 


Ala 


Trp 


lie 


Ser 


Ser 


He Leu Leu 


Ala 


Met 


Leu Tyr Gly Thr 


Gly 










65 






70 




75 


Pro 


Leu 


Cvs 


Ser 


Val 


Cys Val Asn Arg 


Phe 


Gly Cys Arg Pro 


Val 










80 






85 




90 


Met 


Leu 


Val 


Gly 


Gly 


Leu Phe Ala 


Ser 


Leu 


Gly Met Val Ala 


Ala 










95 






100 




105 


Ser 


Phe 


Cys 


Arg 


Ser 


He He Gin 


Val 


Tyr 


Leu Thr Thr Gly 


Val 










110 






115 




120 


lie 


Thr 


Gly 


Leu 


Gly 


Leu Ala Leu 


Asn 


Phe 


Gin Pro Ser Leu 


xie 










125 






130 




135 


Met 


Leu 


Asn 


Arg 


Tyr 


Phe Ser Lys 


Arg Arg 


Fro Met Ala Asn 


Gly 










140 






145 




150 


Leu 


Ala 


Ala 


Ala 


Gly 


Ser Pro Val 


Phe 


Leu 


Cys Ala Leu Ser 


Pro 










155 






160 




lob 


Leu 


Gly 


Gin 


Leu 


Leu 


Gin Asp Arg 


Tyr Gly 


lrp Arg v»±y biy 


Fne 










170 






175 




loO 


Leu 


lie 


Leu 


Gly 


Gly 


Leu Leu Leu 


Asn 


Cys 


\«ys vai uys hj^cl 


nla 










185 






190 




135 


Leu 


Met 


Arg 


Pro 


Leu 


Val Val Thr 


Ala 


Gin 


D«*a f~l "\ \f Coy (~1 1 

tr L.\j uxy ocx uiy 


trX VJ 










200 






205 




OTA. 


Pro 


Arg 


Pro 


Ser 


Arg 


Arg Leu Leu Asp 


Leu 


Oct vai jfrie Ary 


Asp 










215 






220 




o ^ c 


Arg 


Gly 


Phe 


Val 


Leu 


Tyr Ala Val 


Ala 


Ala 


<Zt*r val Mpf Va 1 
OCX vai lie l» v a x 


wcu 










230 






235 




24 0 


Gly 


Leu 


Phe 


Val 


Pro 


Pro Val Phe 


Val 


Val 


Ser Tyr Ala Lys 


Asp 










245 






250 




255 


Leu 


Gly 


Val 


Pro 


Asp 


Thr Lys Ala 


Ala 


Phe 


Leu Leu Thr He 


Leu 










260 






265 




270 


Gly 


Phe 


lie 


Asp 


He 


Phe Ala Arg 


Pro 


Ala 


Ala Gly Phe Val 


Ala 










275 






280 




285 


Gly 


Leu 


Gly 


Lys 


Val 


Arg Pro Tyr 


Ser 


Val 


Tyr Leu Phe Ser 


Phe 










290 






295 




300 


Ser 


Met 


Phe 


Phe 


Asn 


Gly Leu Ala Asp 


Leu 


Ala Gly Ser Thr 


Ala 










305 






310 




315 


Gly 


Asp 


Tyr 


Gly 


Gly 


Leu Val Val 


Phe 


Cys 


He Phe Phe Gly 


He 










320 






325 




330 



WO 99/41373 



PCT/US99/02527 



Ser 


Tyr Gly 


Met 


Val Gly Ala 


Leu 


Gin 


Phe 


Glu 


Val 


Leu Met Ala 








335 






340 






345 


He 


Val Gly Thr 


His Lys Phe 


Ser 


Ser 


Ala 


He Gly Leu Val Leu 








350 






355 






360 


Leu 


Met Glu 


Ala 


Val Ala Val 


Leu 


Val 


Gly 


Pro 


Pro 


Ser Gly Gly 








365 






370 






375 


Lys 


Leu Leu 


Asp 


Ala Thr His 


Val 


Tyr 


Met 


Tyr 


Val 


Phe He Leu 








380 






385 






390 


Ala 


Gly Ala 


Glu 


Val Leu Thr 


Ser 


Ser 


Leu 


He 


Leu 


Leu Leu Gly 








395 






400 






405 


Asn 


Phe Phe 


Cys 


He Arg Lys 


Lys 


Pro 


Lys 


Glu 


Pro 


Gin Pro Glu 








410 






415 






420 


Val 


Ala Ala 


Val 


Glu Glu Glu 


Lys 


Leu 


His 


Lys 


Pro 


Pro Ala Asp 








425 






430 






435 


Ser 


Gly Val 


Asp 


Leu Arg Glu 


Val 


Glu 


His 


Phe 


Leu 


Lys Ala Glu 








440 






445 






450 


Pro 


Glu Lys 


Asn 


Gly Glu Val Val 


His 


Thr 


Pro 


Glu 


Thr Ser Val 








455 






460 






465 



<210> 5 
<211> 237 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 1689731 



<400> 5 
















Met 


Leu 


Glu 


Glu 


Asp 


Met Glu Val Ala 


He 


Lys 


Met Val Val Val 


1 








5 




10 




15 


Gly 


Asn 


Gly 


Ala 


Val 


Gly Lys Ser Ser 


Met 


He Gin Arg Tyr Cys 










20 




25 




30 


Lys 


Gly 


He 


Phe 


Thr 


Lys Asp Tyr Lys 


Lys 


Thr 


He Gly Val Asp 










35 




40 




45 


Phe 


Leu 


Glu 


Arg 


Gin 


He Gin Val Asn 


Asp 


Glu 


Asp Val Arg Leu 










50 




55 




60 


Met 


Leu 


Trp 


Asp 


Thr Ala Gly Gin Glu Glu Phe Asp Ala He Thr 










65 




70 




75 


Lys 


Ala 


Tyr 


Tyr 


Arg Gly Ala Gin Ala Cys 


Val 


Leu Val Phe Ser 










80 




85 




90 


Thr 


Thr 


Asp 


Arg 


Glu 


Ser Phe Glu Ala 


Val 


Ser 


Ser Trp Arg Glu 










95 




100 




105 


Lys 


Val 


Val 


Ala 


Glu 


Val Gly Asp He 


Pro 


Thr 


Val Leu Val Gin 










110 




115 




120 


Asn 


Lys 


He 


Asp 


Leu 


Leu Asp Asp Ser 


Cys 


He 


Lys Asn Glu Glu 










125 




130 




135 


Ala 


Glu 


Ala 


Leu 


Ala 


Lys Arg Leu Lys 


Leu 


Arg 


Phe Tyr Arg Thr 










140 




145 




150 


Ser 


Val 


Lys 


Glu 


Asp 


Leu Asn Val Asn 


Glu 


Val 


Phe Lys Tyr Leu 










155 




160 




165 


Ala 


Glu 


Lys 


Tyr 


Leu 


Gin Lys Leu Lys 


Gin 


Gin 


He Ala Glu Asp 










170 




175 




180 


Pro 


Glu 


Leu 


Thr 


His 


Ser Ser Ser Asn 


Lys 


He 


Gly Val Phe Asn 










185 




190 




195 


Thr 


Ser 


Gly 


Gly 


Ser 


His Ser Gly Gin Asn 


Ser Gly Thr Leu Asn 
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200 205 210 

Gly Gly Asp Val lie Asn Leu Arg Pro Asn Lys Gin Arg Thr Lys 

215 220 225 

Lys Asn Arg Asn Pro Phe Ser Ser Cys Ser lie Pro 

230 235 



<210> 6 

<211> 208 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2751730 

<400> 6 



Met 


Ser 


Ala Arg Gly 


Asp 


Phe 


Gly Asn Pro Leu Arg Lys 


Phe 


Lys 


1 








5 






10 




15 


Leu 


Val 


Phe 


Leu Gly 


Glu 


Gin 


Ser Val Gly Lys Thr Ser 


Leu 


He 










20 






25 




30 


Thr 


Arg 


Phe 


Met 


Tyr 


Asp 


Ser 


Phe Asp Asn Thr Tyr Gin 


Ala 


Thr 










35 






40 




45 


He 


Gly 


lie 


Asp 


Phe 


Leu 


Ser 


Lys Thr Met Tyr Leu Glu 


Asp 


Arg 










50 






55 




60 


Thr 


Val 


Arg 


Leu 


Gin 


Leu Trp Asp Thr Ala Gly Gin Glu 


Arg 


Phe 










65 






70 




75 


Arg 


Ser 


Leu 


He 


Pro 


Ser 


Tyr 


He Arg Asp Ser Thr Val 


Ala 


Val 










80 






85 




90 


Val 


Val 


Tyr 


Asp 


He 


Thr 


Asn 


Leu Asn Ser Phe Gin Gin 


Thr 


Ser 










95 






100 




105 


Lys 


Trp 


He 


Asp Asp 


Val 


Arg 


Thr Glu Arg Gly Ser Asp 


Val 


He 










110 






115 




120 


He 


Met 


Leu 


Val 


Gly 


Asn 


Lys 


Thr Asp Leu Ala Asp Lys 


Arg 


Gin 










125 






130 




135 


He 


Thr 


He 


Glu 


Glu 


Gly Glu 


Gin Arg Ala Lys Glu Leu 


Ser 


Val 










140 






145 




150 


Met 


Phe 


He 


Glu 


Thr 


Ser 


Ala 


Lys Thr Gly Tyr Asn Val 


Lys 


Gin 










155 






160 




165 


Leu 


Phe 


Arg 


Arg 


Val 


Ala 


Ser 


Ala Leu Pro Gly Met Glu 


Asn 


Val 










170 






175 




180 


Gin 


Glu 


Lys 


Ser 


Lys 


Glu Gly Met He Asp He Lys Leu 


Asp 


Lys 










185 






190 




195 


Pro 


Gin 


Glu 


Pro 


Pro 


Ala 


Ser Glu Gly Gly Cys Ser Cys 














200 






205 







<210> 7 

<211> 709 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2794975 
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<400> 7 

Met Ala Thr Cys Ala Glu He Leu Arg Ser Glu Phe Pro Glu He 
1 5 10 15 

Asp Gly Gin Val Phe Asp Tyr Val Thr Gly Val Leu His Ser Gly 
20 25 30 

Ser Ala Asp Phe Glu Ser Val Asp Asp Leu Val Glu Ala Val Gly 
35 40 45 

Glu Leu Leu Gin Glu Val Ser Gly Asp Ser Lys Asp Asp Ala Gly 
50 55 60 

He Arg Ala Val Cys Gin Arg Met Tyr Asn Thr Leu Arg Leu Ala 
65 70 " 75 

Glu Pro Gin Ser Gin Gly Asn Ser Gin Val Leu Leu Asp Ala Pro 
80 85 90 

lie Gin Leu Ser Lys He Thr Glu Asn Tyr Asp Cys Gly Thr Lys 
95 100 105 

Leu Pro Gly Leu Leu Lys Arg Glu Gin Ser Ser Thr Val Asn Ala 
HO 115 120 

Lys Lys Leu Glu Lys Ala Glu Ala Arg Leu Lys Ala Lys Gin Glu 
125 130 . 135 

Lys Arg Ser Glu Lys Asp Thr Leu Lys Thr Ser Asn Pro Leu Val 
140 145 iso 

Leu Glu Glu Ala Ser Ala Ser Gin Ala Gly Ser Arg Lys Glu Ser 
155 160 ~ 165 

Arg Leu Glu Ser Ser Gly Lys Asn Lys Ser Tyr Asp Val Arg He 
170 175 180 

Glu Asn Phe Asp Val Ser Phe Gly Asp Arg Val Leu Leu Ala Gly 
185 190 195 

Ala Asp Val Asn Leu Ala Trp Gly Arg Arg Tyr Gly Leu Val Gly 
200 205 210 

Arg Asn Gly Leu Gly Lys Thr Thr Leu Leu Lys Met Leu Ala Thr 
215 220 225 

Arg Ser Leu Arg Val Pro Ala His He Ser Leu Leu His Val Glu 
230 235 240 

Gin Glu Val Ala Gly Asp Asp Thr Pro Ala Leu Gin Ser Val Leu 
245 250 255 

Glu Ser Asp Ser Val Arg Glu Asp Leu Leu Arg Arg Glu Arg Glu 
260 265 270 

Leu Thr Ala Gin He Ala Ala Gly Arg Ala Glu Gly Ser Glu Ala 
275 280 285 

Ala Glu Leu Ala Glu He Tyr Ala Lys Leu Glu Glu He Glu Ala 
290 295 300 

Asp Lys Ala Pro Ala Arg Ala Ser Val He Leu Ala Gly Leu Gly 
305 310 315 

Phe Thr Pro Lys Met Gin Gin Gin Pro Thr Arg Glu Phe Ser Gly 
320 325 330 

Gly Trp Arg Met Arg Leu Ala Leu Ala Arg Ala Leu Phe Ala Arg 
335 340 345 

Pro Asp Leu Leu Leu Leu Asp Glu Pro Thr Asn Met Leu Asp Val 
350 355 360 

Arg Ala He Leu Trp Leu Glu Asn Tyr Leu Gin Thr Trp Pro Ser 
365 370 375 

Thr He Leu Val Val Ser His Asp Arg Asn Phe Leu Asn Ala He 
380 385 390 

Ala Thr Asp He He His Leu His Ser Gin Arg Leu Asp Gly Tyr 
395 400 405 

Arg Gly Asp Phe Glu Thr Phe lie Lys Ser Lys Gin Glu Arg Leu 
410 415 420 

Leu Asn Gin Gin Arg Glu Tyr Glu Ala Gin Gin Gin Tyr Arg Gin 
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425 

His He Gin Val Phe He Asp Arg Phe 
440 

Ala Ser Gin Val Gin Ser Lys Leu Lys 
455 

Glu Leu Lys Pro Val Asp Lys Glu Ser 
470 

Pro Asp Gly Phe Glu Lys Phe Ser Pro 
485 

Glu Val Asp Phe Tyr Tyr Asp Pro Lys 
500 

Leu Ser Val Ser Ala Asp Leu Glu Ser 
515 

Glu Asn Gly Ala Gly Lys Ser Thr Met 
530 

Asp Leu Ala Pro Val Arg Gly He Arg 
545 

Lys He Gly Tyr Phe Ser Gin His His 
560 

Asn Val Ser Ala Val Glu Leu Leu Ala 
575 

Pro Glu Glu Glu Tyr Arg His Gin Leu 
590 

Gly Glu Leu Ala Met Arg Pro Leu Ala 
605 

Lys Ser Arg Val Ala Phe Ala Gin Met 
620 

Phe Tyr He Leu Asp Glu Pro Thr Asn 
635 

He Glu Ala Leu Gly Arg Ala Leu Asn 
650 

He Leu Val Ser His Asp Glu Arg Phe 
665 

Glu Leu Trp Val Cys Glu Gly Gly Gly 
680 

Gly Phe Asp Gin Tyr Arg Ala Leu Leu 
695 

Glu Gly Phe Leu 



<210> 8 

<211> 962 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2797710 

<400> 8 

Met Asn Phe Leu Arg Gly Val Met Gly 

1 5 
Gin His Thr Glu Ala Glu Thr He Gin 
20 

Ala Ser Ser Thr Leu Leu Asp Asp Arg 
35 

Leu Lys Ser Leu Ser Lys Lys Tyr Arg 



PCT/US99/02S27 



430 








435 


Arg 


Tyr 


Asn 


Ala Asn 


Arg 


445 








450 


Met 


Leu 


Glu 


Lys Leu 


Pro 


460 








465 


Glu 


Val 


Val 


Met Lys 


Phe 


475 








460 


Pro 


He 


Leu 


Gin Leu 


Asp 


490 








495 


His 


Val 


lie 


Phe Ser 


Arg 


505 








510 


Arg 


He Cys 


Val Val 


Gly 


520 








525 


Leu 


Lys 


Lieu 


Leu Leu 


Gly 


535 








540 


His 


Ala 


His 


Arg Asn 


Leu 


550 








555 


Val 


Glu Gin Leu Asp 


Leu 


565 








570 


Arg 


Lys 


Phe 


Pro Gly 


Arg 


580 








585 


Gly 


Arg 


Tyr 


Gly He 


Ser 


595 








600 


Ser 


Leu 


Ser Gly Gly 


Gin 


610 








615 


Thr 


Met 


Pro 


Cys Pro 


Asn 


625 








630 


His 


Leu 


Asp 


Met Glu 


Thr 


640 








645 


Asn 


Phe Arg Gly Gly 


Val 


655 








660 


He 


Arg 


Leu 


Val Cys 


Arg 


670 








675 


Val 


Thr Arg Val Glu 


Gly 


685 








690 


Gin 


Glu 


Gin 


Phe Arg 


Arg 


700 








705 


Gly 


Gin Ser Ala Gly 


Pro 


10 








15 


Lys 


Leu 


Cys 


Asp Arg 


Val 


25 








30 


Arg 


Asn 


Ala 


Val Arg 


Ala 


40 








45 


Leu 


Glu 


Val 


Gly He 


Gin 
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50 55 60 

Ala Met Glu His Leu lie His Val Leu Gin Thr Asp Arg Ser Asp 
65 70 75 

Ser Glu lie lie Gly Tyr Ala Leu Asp Thr Leu Tyr Asn lie lie 
80 85 90 

Ser Asn Glu Glu Glu Glu Glu Val Glu Glu Asn Ser Thr Arg Gin 
95 100 105 

Ser Glu Asp Leu Gly Ser Gin Phe Thr Glu lie Phe lie Lys Gin 

110 115 120 

Gin Glu Asn Val Thr Leu Leu Leu Ser Leu Leu Glu Glu Phe Asp 

125 130 135 

Phe His Val Arg Trp Pro Gly Val Lys Leu Leu Thr Ser Leu Leu 

140 145 150 

Lys Gin Leu Gly Pro Gin Val Gin Gin lie lie Leu Val Ser Pro 

155 160 165 

Met Gly Val Ser Arg Leu Met Asp Leu Leu Ala Asp Ser Arg Glu 

170 175 180 

Val lie Arg Asn Asp Gly Val Leu Leu Leu Gin Ala Leu Thr Arg 

185 190 195 

Ser Asn Gly Ala lie Gin Lys lie Val Ala Phe Glu Asn Ala Phe 

200 205 210 

Glu Arg Leu Leu Asp lie lie Ser Glu Glu Gly Asn Ser Asp Gly 

215 220 225 

Gly lie Val Val Glu Asp Cys Leu He Leu Leu Glri Asn Leu Leu 

230 235 240 

Lys Asn Asn Asn Ser Asn Arg Asn Phe Phe Lys Glu Gly Ser Tyr 

245 250 255 

He Gin Arg Met Lys Pro Trp Phe Glu Val Gly Asp Glu Asn Ser 

260 265 270 

Gly Trp Ser Ala Gin Lys Val Thr Asn Leu His Leu Met Leu Gin 

275 280 285 

Leu Val Arg Val Leu Val Ser Pro Thr Asn Pro Pro Gly Ala Thr 

290 295 300 

Ser Ser Cys Gin Lys Ala Met Phe Gin Cys Gly Leu Leu Gin Gin 

305 310 315 

Leu Cys Thr He Leu Met Ala Thr Gly Val Pro Ala Asp He Leu 

320 325 330 

Thr Glu Thr He Asn Thr Val Ser Glu Val He Arg Gly Cys Gin 

335 340 345 

Val Asn Gin Asp Tyr Phe Ala Ser Val Asn Ala Pro Ser Asn Pro 

350 355 360 

Pro Arg Pro Ala He Val Val Leu Leu Met Ser Met Val Asn Glu 

365 370 375 

Arg Gin Pro Phe Val Leu Arg Cys Ala Val Leu Tyr Cys Phe Gin 

380 385 390 

Cys Phe Leu Tyr Lys Asn Gin Lys Gly Gin Gly Glu He Val Ser 

395 400 405 

Thr Leu Leu Pro Ser Thr He Asp Ala Thr Gly Asn Ser Val Ser 

410 415 420 

Ala Gly Gin Leu Leu Cys Gly Gly Leu Phe Ser Thr Asp Ser Leu 

425 430 435 

Ser Asn Trp Cys Ala Ala Val Ala Leu Ala His Ala Leu Gin Glu 

440 445 450 

Asn Ala Thr Gin Lys Glu Gin Leu Leu Arg Val Gin Leu Ala Thr 

455 460 465 

Ser He Gly Asn Pro Pro Val Ser Leu Leu Gin Gin Cys Thr Asn 

470 475 480 

He Leu Ser Gin Gly Ser Lys He Gin Thr Arg Val Gly Leu Leu 
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485 490 495 



Met 


Leu 


Leu 


Cys 


Thr 


Trp 


Leu Ser 


Asn Cys 


Pro 


He 


Ala 


Val Thr 










500 








505 








510 


His 


Pne 


Leu 


His 


Asn 


Ser 


Ala Asn 


Val 


Pro 


Phe 


Leu 


Thr 


Gly Gin 










515 








520 








525 


lie 


Ala 


Glu 


Asn 


Leu Gly Glu Glu Glu Gin 


Leu Val Gin Gly Leu 










530 








535 








54 0 


Cys 


Ala 


Leu 


Leu 


Leu 


Gly 


He Ser 


He 


Tyr 


Phe 


Asn Asp Asn Ser 










545 








550 








555 


Leu 


Glu 


Ser 


Tyr 


Met 


Lys 


Glu Lys 


Leu 


Lys 


Gin 


Leu 


He Glu Lys 










560 








565 








570 


Arg 


He 


Gly Lys 


Glu 


Asn 


Phe He 


Glu 


Lys 


Leu Gly 


Phe 


He Ser 










575 








580 








585 


Lys 


His 


Glu 


Leu 


Tyr Ser Arg Ala 


Ser 


Gin 


Lys 


Pro 


Gin 


Pro Asn 










590 








595 








600 


Phe 


Pro 


Ser 


Pro 


Glu 


Tyr 


Met He 


Phe 


Asp 


His 


Glu 


Phe 


Thr Lys 










605 








610 








615 


Leu 


Val 


Lys 


Glu 


Leu Glu Gly Val 


He 


Thr 


Lys 


Ala 


He 


Tyr Lys 










620 








625 








630 


Ser 


Ser 


Glu 


Glu 


Asp 


Lys 


Lys Glu 


Glu 


Glu 


Val 


Lys 


Lys 


Thr Leu 










635 








640 








645 


Glu 


Gin 


His 


Asp 


Asn 


He 


Val Thr 


His 


Tyr 


Lys 


Asn 


Met 


He Arg 










650 








655 








660 


Glu Gin Asp Leu 


Gin 


Leu 


Glu Glu 


Leu Arg 


Gin 


Gin 


Val 


Ser Thr 










665 








670 








675 


Leu 


Lys 


Cys 


Gin 


Asn 


Glu 


Gin Leu 


Gin 


Thr 


Ala 


Val 


Thr 


Gin Gin 










680 








665 








690 


Val 


Ser 


Gin 


He 


Gin 


Gin 


His Lys 


Asp 


Gin 


Tyr 


Asn 


Leu 


Leu Lys 










695 








700 








705 


He 


Gin Leu Gly 


Lys 


Asp 


Asn Gin 


His 


Gin 


Gly 


Ser 


Tyr 


Ser Glu 










710 








715 








720 


Gly Ala Gin Met 


Asn Gly 


He Gin 


Pro 


Glu 


Glu 


He Gly Arg lieu 










725 








730 








735 


Arg 


Glu 


Glu 


He 


Glu 


Glu 


Leu Lys 


Arg 


Asn 


Gin 


Glu 


Leu 


Leu Gin 










740 








745 








750 


Ser 


Gin 


Leu 


Thr 


Glu 


Lys 


Asp Ser 


Met 


He 


Glu 


Asn 


Met 


Lys Ser 










755 








760 








765 


Ser 


Gin 


Thr 


Ser 


Gly Thr Asn Glu Gin Ser 


Ser 


Ala 


He 


Val Ser 










770 








775 








780 


Ala 


Arg 


Asp 


Ser 


Glu 


Gin 


Val Ala 


Glu 


Leu 


Lys 


Gin 


Glu 


Leu Ala 










785 








790 








795 


Thr 


Leu Lys 


Ser 


Gin 


Leu 


Asn Ser 


Gin 


Ser 


Val 


Glu 


He 


Thr Lys 










800 








805 








810 


Leu 


Gin 


Thr 


Glu 


Lys 


Gin 


Glu Leu 


Leu 


Gin 


Lys 


Thr 


Glu 


Ala Phe 










815 








820 








825 


Ala 


Lys 


Ser 


Val 


Glu 


Val 


Gin Gly Glu 


Thr <3lu 


Thr 


He 


lie Ala 










830 








835 








840 


Thr 


Lys 


Thr 


Thr 


Asp 


Val 


Glu Gly Arg 


Leu 


Ser 


Ala 


Leu 


Leu Gin 










845 








850 








855 


Glu 


Thr 


Lys 


Glu 


Leu 


Lys 


Asn Glu 


He 


Lys 


Ala 


Leu 


Ser 


Glu Glu 










860 








865 








870 


Arg 


Thr 


Ala 


He 


Lys 


Glu 


Gin Leu Asp 


Ser 


Ser 


Asn 


Ser 


Thr He 










875 








880 








885 


Ala 


He 


Leu 


Gin 


Thr 


Glu 


Lys Asp 


Lys 


Leu 


Glu 


Leu 


Glu 


He Thr 










890 








895 








900 


Asp 


Ser 


Lys 


Lys 


Glu 


Gin 


Asp Asp 


Leu 


Leu 


Val 


Xjeu 


Leu Ala Asp 










905 








910 








915 


Gin Asp Gin Lys 


He 


Leu 


Ser Leu 


Lys 


Asn 


Lys 


Leu 


Lys 


Asp Leu 
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920 925 930 

Gly His Pro Val Glu Glu Glu Asp Glu Leu Glu Ser Gly Asp Gin 

935 940 945 

Glu Asp Glu Asp Asp Glu Ser Glu Asp Pro Gly Lys Asp Leu Asp 

950 955 960 

His lie 



<210> 9 

<211> 368 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2914719 

<400> 9 

Met Ser Leu Phe Gly Thr Thr Ser Gly Phe Gly Thr Ser Gly Thr 
15 10 is 

Ser Met Phe Gly Ser Ala Thr Thr Asp Asn His Asn Pro Met Lys 
20 25 30 

Asp He Glu Val Thr Ser Ser Pro Asp Asp Ser He Gly Cys Leu 
35 40 45 

Ser Phe Ser Pro Pro Thr Leu Pro Gly Asn Phe Leu He Ala Gly 
50 55 60 

Ser Trp Ala Asn Asp Val Arg Cys Trp Glu Val Gin Asp Ser Gly 
65 70 75 

Gin Thr He Pro Lys Ala Gin Gin Met His Thr Gly Pro Val Leu 
80 85 90 

Asp Val Cys Trp Ser Asp Asp Gly Ser Lys Val Phe Thr Ala Ser 
95 100 105 

Cys Asp Lys Thr Ala Lys Met Trp Asp Leu Ser Ser Asn Gin Ala 
HO us 120 

He Gin He Ala Gin His Asp Ala Pro Val Lys Thr lie His Trp 
125 130 135 

He Lys Ala Pro Asn Tyr Ser Cys Val Met Thr Gly Ser Trp Asp 
140 145 150 

Lys Thr Leu Lys Phe Trp Asp Thr Arg Ser Ser Asn Pro Met Met 
155 160 165 

Val Leu Gin Leu Pro Glu Arg Cys Tyr Cys Ala Asp Val He Tyr 
170 175 180 

Pro Met Ala Val Val Ala Thr Ala Glu Arg Gly Leu He Val Tyr 
185 190 195 

Gin Leu Glu Asn Gin Pro Ser Glu Phe Arg Arg He Glu Ser Pro 
200 205 210 

Leu Lys His Gin His Arg Cys Val Ala He Phe Lys Asp Lys Gin 
215 220 225 

Asn Lys Pro Thr Gly Phe Ala Leu Gly Ser He Glu Gly Arg Val 
230 235 240 

Ala He His Tyr He Asn Pro Pro Asn Pro Ala Lys Asp Asn Phe 
245 250 255 

Thr Phe Lys Cys His Arg Ser Asn Gly Thr Asn Thr Ser- Ala Pro 
260 265 270 

Gin Asp He Tyr Ala Val Asn Gly He Ala Phe His Pro Val His 
275 280 285 

Gly Thr Leu Ala Thr Val Gly Ser Asp Gly Arg Phe Ser Phe Trp 
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290 295 300 

Asp Lys Asp Ala Arg Thr Lys Leu Lys Thr Ser Glu Gin Leu Asp 

305 310 315 

Gin Pro lie Ser Ala Cys Cys Phe Asn His Asn Gly Asn lie Phe 

320 325 330 

Ala Tyr Ala Ser Ser Tyr Asp Trp Ser Lys Gly His Glu Phe Tyr 

335 340 345 

Asn Pro Gin Lys Lys Asn Tyr lie Phe Leu Arg Asn Ala Ala Glu 

350 355 360 

Glu Leu Lys Pro Arg Asn Lys Lys 

365 



<210> 10 

<211> 1527 

<212> DNA 

<213> Homo sapiens 



<220> - 
<223> 144861 



<400> 10 

ctcgccccga 

acgacacctt 

tgcagaacaa 

ggtatgccct 

acctgcatct 

cggccggagg 

gttctcagct 

gatgctgctg 

cccggtgtac 

ccagctcatg 

gcgctggctg 

catgcccaac 

gctggcctgg 

caacgtccgg 

ccggcccatc 

catcctggtc 

cgacgcagcc 

ggacctcgca 

ccatcatgtt 

tgagccccaa 

tgcagcaccc 

gggtaggtgt 

gtgtctgggg 

ctccagtcac 

tggggtccat 

caaggcctac 



gagagacccg 
ccccgagaag 
aagggtgttc 
ggtctacaca 
gaccaaatcc 
ctgagtgcca 
gtgccgtcgg 
ctcggaagga 
gtgtctgaga 
gcagtgttcg 
gctgtggccg 
tcgccgcgct 
ctgcgtggga 
agacagagca 
accgtggcct 
tacctgcagt 
atcgttgggg 
ggccgcaagg 
tgctgccaac 
cagcactgcg 
gctggctatc 
ggtggtggct 
tggctggaga 
cccacagggc 
cacctggctg 
gccgtgggct 



gccatgcagg 
ccgcccccgt 
ctggccacct 
tcccctgtca 
caggcatcct 
tgatcctcaa 
cggccggcta 
cgctgacggg 
ttgctccccc 
gatccctgtc 
gggaggcgcc 
tcctgctctc 
cggacgtcga 
gccgagtatc 
tgctgatgcg 
ccatcttcga 
ccgtgcggct 
tgctgctctt 
ctgactctgg 
ggcctggaaa 
tcaccctggt 
cagaggggca 

ggggggtctc 

ctgaacctgc 
ctcatgtctg 

ggggtcc 



agccgctgct 
cgccagggga 
tcgccgcagt 
tcccagccct 
ggtttgggtc 
cgacctcctg 
tgcgctcatg 
cttcgccggg 
aggcgttcgt 
cctctacgcc 
tgtgctcatc 
tcggggcagg 
tgtccactgg 
gtgggctgag 
cctcctgcag 
cagcaccgct 
cctgtccgtg 
cgtctcaggc 
ggctgtacat 
gcgagtcctg 
gcccctgctg 
ggctgtcttg 
cagcagctca 
ctcctcctcc 
aggtcctgcc 



gggagccgag 
cagggcgcgg 
gctcggcaat 
ggagcgctcc 
cgtgttcacc 
ggccggaagc 
gcgggtgcgc 
ggtctcacag 
ggggctctgg 
cttggcctcc 
atgatcctgc 
gacgaagagg 
gagttcgagc 
gcacgggccc 
cagctgacgg 
gtcctgctgc 
ctgatcgccg 
cacctgtctg 
ccactttggc 
gggggacttg 
gccaccatgc 
ggtgttaagg 
gcggagacag 
gagcaggcta 
ctgcgtgccg 



ggcccggact 
gtcgggaccc 
ttoagctttg 
ttggatcctg 
ctgggagcag 
tgagcatcat 
acggcctctg 
ctgcctgcat 
gggccacacc 
tgctgccgtg 
tgctcagctt 
ccctgcgggc 
agatccagga 
cccacgtgtg 
gcatcacgcc 
cccccaagga 
ccctcaccat 
tccacagcgg 
cocaggcctc 
gcgcagccct 
tcttcatcat 
gatgggtgat 
acacagocgc 
cgocgtgggc 
tggcgtggct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1S27 



<210> 11 

<211> 1270 

<212> DNA 

<213> Homo sapiens 



<220> 
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<223> 607812 



<400> 11 

ggatgccatc cttctcaaaa gacttattga 
agggacctgg gtctacgata acgcgctttt 
tgttccagag tgtaccatgg cttccagtaa 
atattctatt gctcaaaagg caggaatgat 
gggtattgtg gagaagacct gtgcaacaga 
gatgagcata tgttcttcat tggcccggaa 
gctcgtggtc tgggttgatc ctctggatgg 
caatgtaaca gttcttattg gaattgctta 
ccagccatat tacaactatg aggcaggacc 
agttttaggt ttaggcgcct ttgggtttca 
tatcacaact actcgatccc atagcaacaa 
ccccgatgct gtgctgcgag taggaggagc 
caaagcctct gcttatgtat ttgcaagtcc 
agaagttatt ttacatgctg tgggaggcaa 
gtaccacaag gatgtgaagc atatgaactc 
tgactactat gcaagccgag ttccagaatc 
agtttcattt ggccgggcgc ggtggctcat 
ggcaggtgga tcacttgagc tcgggagttt 
catctctaca aaaatacaaa ttaactgggc 
ttgagaggct gaagcagaag aatctcttgg 
gatcgtgcca ctgcactcca gcctgagtga 
tataaaaggg 



cagtgccaaa gctcggtact ggacacaacg 60 
gctcctcctg aagtgtcttt ggtccaacgt 120 
cactgtgttg atgcggttgg tagcctccgc 180 
agtcagacgt gttattgctg aaggagacct 240 
cctgcagacc aaagctgacc gattggcaca 300 
attccccaaa ctcacaatta taggggaaga 360 
aaccaaggaa tataccgaag gtcttcttga 420 
tgaaggaaaa gccatagcag gagttattaa 480 
agatgctgtg ttggggagga caatctgggg 540 
gctgaaagaa gtccctgctg ggaaacacat 600 
gttggttact gactgtgttg ctgctatgaa 660 
aggaaataag attattcagc tgattgaagg 720 
tggttgtaag aagtgggata cttgtgctcc 780 
gttaaccgat atccatggga atgttcttca 840 
tgcaggagtc ctggccacac tgaggaatta 900 
tattaaaaat gcacttgttc cttaaaggaa 960 
gcctgtagtc ccggcacttt gggaggccga 1020 
gagaccagcc tgggcaatat cgtgagaccc 1080 
atcctgtcat gcgcctgtcg tcccagctac 1140 
gcccgggagg cggaggttgc agtgagctga 1200 
caggagttaa gccctgtctc agaaatagaa 1260 

1270 



<210> 12 
<211> 902 
<212> DMA 

<213> Homo sapiens 

<220> - 
<223> 1259384 



<400> 12 

gcggaatgca gcggcccgag gcctggccac 
cccaggccgg gggcccggcg ccgcctgctc 
aggaaccttc cctctacacc atcaaggctg 
tgctggccaa gtattatgat gacacattcc 
aaaatgtctt caacaagacc agccggactg 
ccatcgtcta caagaacagc attgacctct 
atgagctgat gctcatgtct gttctcacct 
ggaagaacgt ggagaagcgc tggttgctgg 
acgagattgt ggatggcggt gtgattctgg 
tgaattttag ggcagatgat ggcggcttga 
ctgccaagga acaaattaaa tggtcgttat 
tgccccccag atcatttccc caatcctggc 
acccctctgt atccccaggt ccctcccaga 
ctgagatgca aaggtttggc ctcaggagag 
catatcttag gcattcctgg ccccagggcc 
aa 



gtccgcaccc gggggagggg gccgcggcgg 60 
gagccgggga gccctcgggg ctgcggttgc 120 
ttttcatcct agataatgac ggrgcgccggc 180 
cctccatgaa ggagcagatg gttttcgaga 240 
agagtgagat tgcatttttt gggggtatga 300 
tcctatacgt ggtgggctca tcctacgaga 360 
gcctgtttga gtctctgaac cacatgttaa 420 
agaacatgga cggagccttc ttggtgctgg 480 
agagtgaccc ccagcaagtg atccagaagg 540 
ctgaacagag tgtggcccag gttcttcagt 600 
tgaaatgaag gctgtggatt caaggctccc 660 
aaaagcccaa agatcccagg gtcaggagag 720 
actgactcct aaggtctcca gccagggctt 780 
tcaccttttc tcacggccct ggccttaact 84 0 
ctaataaacc tgcttttgtc ttctgcaaat 900 

902 



<210> 13 
<211> 2026 
<212> DNA 

<213> Homo sapiens 
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<220> - 
<223> 1340813 

<400> 13 

gcagaggcgg cgagaggcgg gctgaggcgg cccagcggcg gcaggtgagg cggaa-ccaac 60 
cctcctggcc atgggagggg ccgtggtgga cgagggcccc acaggcgtca aggcccctga 120 
cggcggctgg ggctgggccg tgctcttcgg ctgtttcgtc atcaccggct tctcctacgc 180 
cttccccaag gccgtcagtg tcttcttcaa ggagctcata caggagtttg ggatcggcta 240 
cagcgacaca gcccggatct cctccatcct gctggccatg ctctacggga caggtocgct 300 
ctgcagtgtg tgcgtgaacc gctttggctg ccggcccgtc atgottgtgg ggggtctctt 360 
tgcgtcgctg ggcatggtgg ctgcgtcctt ttgccggagc atcatccagg totacctcac 420 
cactggggtc atcacggggt tgggtttggc actcaacttc cagccctcgo tcatcatgct 480 
gaaccgctac ttcagcaagc ggcgccccat ggccaacggg ctggcggcag caggtagccc 540 
tgtcttcctg tgtgccctga gcccgctggg gcagctgctg caggaccgct acggctggcg 600 
gggcggcttc ctcatcctgg gcggcctgct gctcaactgc tgcgtgtgtg ccgcactcat 660 
gaggcccctg gtggtcacgg cccagccggg ctcggggccg ccgcgaccct cccggcgcct 720 
gctagacctg agcgtcttcc gggaccgcgg ctttgtgctt tacgccgtgg ccgocfccggt 780 
catggtgctg gggctcttcg tcccgcccgt gttcgtggtg agctacgcca aggacctggg 840 
cgtgcccgac accaaggccg ccttcctgct caccatcctg ggcttcafctg aoatcttogc 900 
gcggccggcc gcgggcttcg tggcggggct tgggaaggtg cggccctact ccgtctacct 960 
cttcagcttc tccatgttct tcaacggcct cgcggacctg gcgggctcta cggcgggcga 102 0 
ctacggcggc ctcgtggtct tctgcatctt ctttggcatc tcctacggca tggtgggggc 1080 
cctgcagttc gaggtgctca tggccatcgt gggcacccac aagttctcca gtgccattgg 114 0 
cctggtgctg ctgatggagg cggtggccgt gctcgtcggg cccccttcgg gaggcaaact 1200 
cctggatgcg acccacgtct acatgtacgt gttcatcctg gcgggggccg aggtgctcac 1260 
ctcctccctg attttgctgc tgggcaactt cttctgcatt aggaagaagc ccaaagagcc 1320 
acagcctgag gtggcggccg tggaggagga gaagctccac aagcctcctg <:agactcggg 1380 
ggtggacttg cgggaggtgg agcatttcct gaaggctgag cctgagaaaa acggggaggt 1440 
ggttcacacc ccggaaacaa gtgtctgagt ggctgggcgg ggccggcagg cacagggagg 1500 
aggtacagaa gccggcaacg cttgctattt attttacaaa ctggactggc tcaggcaggg 1560 
ccacggctgg gctccagctg ccggcccagc ggatcgtcgc ccgatcagtg ttttgagggg 1620 
• gaaggtggcg gggtgggaac cgtgtcattc cagagtggat ctgcggtgaa gccaagccgc 1680 
aaggttacaa ggcatcctca ccaggggccc cgcctgctgc tcccaggtgg cctgcggcca 1740 
ctgctatgct caaggacctg gaaacccatg cttcgagaca acgtgacttt aatgggaggg 1800 
tgggtgggcc gcagacaggc tggcagggca ggtgctgcgt ggggccctct ccagcccgtc 1860 
ctaccctggg ctcacatggg gcctgtgccc acccctcttg agtgtcttgg ggacagctct 1920 
ttccacccct ggaagatgga aataaacctg cgtgtgggtg gagtgttagg aaaaaaaaaa 1980 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtct 2026 



<210> 14 

<211> 2829 

<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 1689731 

<400> 14 

ccgatcccaa gttccagcta gagaaggtgg gcggcagcgc cagccxrggcg ccccctccca 60 

gctcccagcg atttgagagg gaagatctga tctcctagaa ttagagttgg tacagaaacc 120 

atttcagctc caaaaatgtt ggaggaagat atggaagtcg ccataaagat ggtggttgta 180 

gggaatggag cagttggaaa atcaagtatg attcagcgat attgcaaagg catttttaca 240 

aaagactaca agaaaaccat tggagttgat tttttggagc gacaaattca agttaatgat 300 

gaagatgtca gactaatgtt atgggacact gcaggtcagg aggaatttga tgcaattaca 360 

aaggcctact atcgaggagc ccaggcttgt gtgctcgtgt tctctaccac agatagggaa 420 

tcttttgaag cagtttccag ttggagagag aaagtagtag ccgaagtggg agatatacca 480 

actgtacttg tgcaaaacaa gattgatctt ctggatgatt cttgtataaa gaatgaggaa 540 
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gctgaggcac tggcaaaaag gttaaagtta agattctaca gaacatcagt gaaagaagat 600 
ctaaatgtga atgaagtttt taagtatttg gctgaaaaat accttcagaa actcaaacaa 660 
caaatagctg aggatccaga actaacgcat tcaagtagta acaagattgg tgtctttaat 720 
acatctggtg gaagtcactc cggtcagaat tcaggtaccc tcaatggtgg agatgtcatc 780 
aatcttagac ccaacaaaca aaggaccaag aaaaacagaa atccttttag cagctgtagc 840 
ataccctaag atgttttggg aggaaaacaa ttgaattaca ttgtgcaatg cattaagaaa 900 
cccatccagc tgtcatggta tgttacaagg ttttcagcag actttgctcc taatggctct 960 
ctgaaagtga cagacttaaa tattgctctg cagtgctttt cagaatgtag agtgagacct 1020 
ctggtggaaa aattattgcc ctgtggactc ctttaatttt ttttacatta gacgaagctt 1080 
ctcctgtttg taaaggaatg ctataagaaa tttcaaacac aggttttgcg gaattttaaa 1140 
tgctggaaaa cataaatgaa atgcctaaat atattgttac agattttaat actgtggatc 1200 
aaggaaagat agcaagcatt cttttcctga aattggtcat gtagcttttc tgcgaaatag 1260 
tattgaatct gaatactttc acaaaataat actgaaaggt aaaccctaaa tcttgccaga 1320 
ctgcattagt attttggcat tgaaacttgt gcaatatctg tgtaatgtag aatgtggggt 1380 
gtgcatgcag atgtctggta ctggagtaag ctgaaggtat gtgcagtatt taagaaaatg 1440 
tacattctaa ttaattcact tataaaacca tttacatatt gtgtatatgt ttttattttc 1500 
aaatgataaa ctctagacat atattttgca aaatgagctt gggtttaaaa agtgctttgt 1560 
gccaaaaata tcatacctaa tttttatagt gctttactat cagaatttat attgtttatt 1620 
atatctttta aaagcagtaa gcagttttca tctgacagcg tagataaaag aggagtatag 1680 
gaatttgaat gctagagcat ctcccaggtg agccttattg cagatctttt tgatatagtc 1740 
acccataagc atggtgagct cctgctctca taaggtcagc caaagcccct taagaaggta 1800 
cacagcacat ctgcagtcat tcaggaggtc ctgtgaagtg tggtgatgca tctaacacct 1860 
tttgctgctc actgagaacc cagctttgaa cttacaagat tacactaacc atttttaaag 1920 
tgcacaaaaa tattattatt agtcttaact aaaattacat attttataat gccctgcaac 1980 
tctaaaccat ccttttgttt acacattttt gtatggtatt ttaagattat caatcacttc 2040 
aacatttcta taaccttgtt tgtatagcac tttctcaaga gctgtcagtc ccacatacag 2100 
aatcatttct tttacaaaat gatttaatat gtaatcatgt gtcaggacat aaattttgtc 2160 
acttgggaca gtagctttcc agctaagata attgcgtgaa ctctgttaaa actgattatc 2220 
actcatcaat ggctttattt ttaacctata aataacccct ttcaatatac ctggttttta 2280 
cagcattgca aaatctgtaa aaataacttt tattggaaaa tgtcatgaag tcatttttca 2340 
ttatttgtac taaacaatgt tttatgtact ccttttgtgt tttatctttt gtttttccat 2400 
ctgtactccc ctcatcctcc taatgttgag aaaatcaaat ttcctgggtg tgtccttgtt 2460 
aagtgatcaa aagtaaatgt tctctttgcc attttagatg gaaaaataaa aaaaattaaa 2520 
aagtaaattt ttgcttttca ttatattgaa atgatctggc attatctatt tcacagtaat 2580 
gtaaaatgct tatttttatg tcaccagttg tcaccactcc cattgctgat agtgacatag 2640 
gttttggggg caaaattcag ttcctaacaa acagtggata ctgctaatga gtcatctttt 2700 
ctactctctt caattttcta ctctctactc tcctctttaa tttttgcaca aaatcctact 2760 
ttgcaacaaa tagtcttttg tccttattgg gaggaaaatc acttgctaaa tgtgctctag 2820 
ttagttctt * 



<210> 15 

<211> 1589 

<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 2751730 

<400> 15 

cccggctgcc agcaccatgt ccgccagggg 
gttggtgttc ttgggggagc agagcgtcgg 
cgacagcttc gacaacacat accaggcaac 
gtacttggag gaccgcacgg tgcgactgca 
ccgcagcctg atccccagct acatccggga 
cacaaatctc aactccttcc aacagacctc 
gggcagtgat gttatcatca tgctggtggg 
gataaccatc gaggaggggg agcagcgcgc 



agattttggg aatccactga gaaaattcaa 60 
gaagacgtct ctgattacga ggttcatgta 120 
cattgggatt gacttcttgt caaaaaccat 180 
gctctgggac acagctggtc aggagaggtt 240 
ctccacggtg gctgtggtgg tgtacgacat 300 
taagtggatc gacgacgtca ggacagagag 360 
caacaagacg gacctggctg ataagaggca 420 
caaagaactg agcgtcatgt tcattgagac 4 60 
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cagtgcgaag actggctaca acgtgaagca 
cggaatggag aatgtccagg agaaaagcaa 
accccaggag cccccggcca gcgagggcgg 
gcttcccatg acactccttg cttgttgtgt 
gagggaaccg agttatcaag atgggaggat 
gtgggatggg gagggaggct gggcatcagg 
aacaactatt ttttggtttg gttgtaatat 
ttaaaattta aaaaaaattt aaatcatgtg 
tcatttttat ttgaaagatg tgctttttgt 
ttttccacac acctcctcct tcctgttctc 
ctatccccag cccaataaat taaaacaatt 
gggccatctg gccccactct ctaggcccta 
atcagcctct gaaaaggtag ctctgactct 
cctcgtgtgg agctgagacc aaggtctgtg 
aggaactgct cccacctctc cttctctttc 
caggcaacct tgtgttctct gtgccctcag 
acgaggtgct gttggcaatt gcagctgcgt 
tgtacttctg aaaatatttt aaccagccca 
gtctaagaaa ctcagtgtgt gatgttggg 



gctttttcga cgtgtggcgt cggctctacc 540 
agaagggatg attgacatca ag-ctggacaa 600 
ctgctcctgc taatgcagag ccgacctgtg 660 
tgcttcctat tggctagctt cctaaggggg 720 
ttttcttttc tctctgtctt taggagtagg 780 
gatcacatca ctcttaacgg ctgttactta 840 
attgtacttt attaagattg ccaaaaactg 900 
tatacaattt tttgccagat aaaaatgtag 960 
ttttgtatat ttgtaaactt atagagaacc 1020 
tttgaaccat tcatcacctc tgccttcctc 1080 
aactgagcaa attaattagg cttcagtctg 1140 
ctccagttaa atcaaacatt ggttgaacac 1200 
tgtctttcca tggccagagt gggtcgfecaa 1260 
ggctctaggc agcccggcag cagcccccag 1320 
cctgaaaagc cttgaagggg gttgagggta 1380 
ggactggcag agcaagaggc catcagaagg 1440 
ggtgcgtggt ttatagactg gggtgccatg 1500 
cccagcagtc agfcacacatg cctgggccaa 1560 

1589 



<210> 16 

<211> 2533 

<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 2794975 

<400> 16 

cacagcgact gcgcggacgg gttcctgagt ggaacatggc gacttgcgcc gaaatcctgc 60 
ggagcgagtt ccccgaaatt gacggacaag tcttcgacta cgfcgaccggc gtcttgcaca 120 
gcggcagcgc ggacttcgag tctgtggatg acctggtgga agctgtaggg gaactattgc 180 
aagaggtgtc cggggacagc aaggatgacg cgggcatcag ggccgtgtgc cagcgcatgt 240 
acaacactct gcgtctggct gagccacaaa gccagggaaa tagccaggtg cta<:tggacg 300 
cccctatcca gttgtcaaag ataacggaga actacgactg tggaaccaaa cttccaggac 360 
tgctaaagag ggaacagtcc tcgacagtga atgcaaagaa gttagagaag gccgaggctc 420 
gacttaaggc aaagcaggag aagcgctcag agaaggacac gctcaagacc agcaacoctc 480 
tagtcttaga agaggcatca gccagccagg caggcagcag aaaggagagt cggttggaat 540 
catctggcaa gaacaaatcc tatgatgtgc gaattgagaa ctttgatgtg tcttttggcg 6O0 
atagagtact gctggctgga gcggatgtga acctggcatg gggccgocgt tacgggctgg £60 
tggggcggaa tgggttgggg aagacaacgt tactgaagat gctggccacc cggagtctgc 720 
gggttccagc ccacatttcc ctgctgcacg ttgagcaaga ggttgctgga gatga-cactc 780 
ctgccctgca gagtgtgctg gagagtgaca gtgtgcgaga ggatttgcta oggagggagc 840 
gggagctcac tgcccagatt gctgctggca gggcggaggg ctctgaagct gcagagctgg 900 
cagaaatcta tgccaaactg gaggagattg aggctgacaa ggcacctgcc agggcatcag 960 
tcattctcgc tgggcttggc tttaccccta aaatgcagca gcagcocacc cgggagttct 1020 
caggtggctg gaggatgagg ctggccctgg cccgggccct ctttgctagg ccagatcttc 1080 
tgctgttaga tgaacctaca aacatgctgg atgtcagggc catcctgtgg ctggagaatt 1140 
acctgcagac gtggccctcc accatcctag tcgtctccca cgaccgcaac ttcttgaatg 1200 
ccatcgccac agacatcatc cacctgcaca gccagcggct agatggttac cggggagact 1260 
ttgagacctt catcaagagt aagcaggagc ggctgctcaa ccagcagcgt gaatatgagg 1320 
cgcagcagca gtatcgccag cacatccagg ttttcattga ccggtttcgc tacaatgcca 1380 
acagggcctc tcaagtgcag agtaaactca agatgctgga gaagctgcct gagctgaagc 1440 
ctgtggacaa ggaatcagag gtcgtaatga agttccctga tgggtttgag aagttotcgc 1500 
cgccaattct gcagctagat gaggtggatt tctacta-cga tecgaagcac gtcatottca 1560 
gtcgcctctc tgtgtctgct gatct-cgagt ctcgcatctg tgtggttgga gagaatgggg 1620 
ctgggaagtc taccatgctg aagctgcttt tgggggacct ggcacctgtt cggggcatca 1680 
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gacacgctca caggaatctg aagattggct 
acctaaacgt cagtgctgtg gaactgctgg 
agtaccgtca ccagctgggt cggtatggca 
ccagcctgtc tgggggccag aagagccgag 
ccaacttcta cattctggat gaacccacaa 
tgggccgtgc cctcaacaat ttcaggggtg 
ttatcaggct ggtgtgccgg gagttgtggg 
aaggaggatt tgaccagtac cgcgccctcc 
tctagggcca ccaggctgag gactcgccca 
ccaccatgta ggccaccact ccaggccgtg 
cccaaatgtc tctatccttt tgactggagc 
a 999ttggtg aggactggtc tcccgggggt 
agattccccc actgccccag ctctgactgg 
tctccccgcg tcaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aag 



<210> 17 
<211> 4312 
<212> DNA 

<213> Homo sapiens 



atttcagcca gcaccatgtg gagcagctgg 1740 
cacgcaagtt tcctgggcgg cctgaggagg 1800 
tctccggaga actggccatg cgtcctcttg i860 
tggcctttgc tcagatgact atgccctgcc 1920 
accacctgga catggagacc attgaggctc 1980 
gtgtgattct ggtgtcccac gatgagcgct 204 0 
tatgcgaagg aggcggcgtc acccgtgtgg 2100 
tccaggaaca gttccgccgc gaaggcttcc 2160 
ggacatggac tggtctctca gacccctggg 2220 
gacttccccc aacttgggga cagccttatt 2280 
atcttctgca caaccttggg agcccatcca 234 0 
gggggtctgg ggggtaccct ctggggttat 24 00 
accccaagtg gctgctatgt aaattaaatc 2460 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2520 

2533 



<220> - 

<223> 2797710 

<400> 17 

cttcgagccg ccacgtaatg ccacgtcccc 
ctgtttccgg gcttagaggg ctggagtggc 
aggagtgtgt agagtgcggg attgggggcc 
tctttttttt ccggaggggc cggtaaacct 
gcggggtaat ggggggtcag agtgccggac 
agctttgtga cagagtagct tcatctactt 
ctctcaaatc attatctaag aaataccgct 
ttattcatgt tttacaaaca gatcgttcag 
cactatataa tataatatct aatgaagaag 
agagtgaaga tttgggaagc caatttacag 
ctcttctgtt atctttattg gaggagtttg 
ttcttacttc tcttttaaaa caactagggc 
ctatgggtgt ttcaagattg atggacttac 
atggcgtctt actactgcag gcactaacaa 
cttttgaaaa tgctttcgag agactactgg 
gaggtatagt agttgaagat tgtttgattt 
ccaatcgaaa tttttttaaa gaaggctcat 
ttggagatga aaattctggc tggtctgcac 
agcttgttcg tgtattggta tctcccacca 
aggctatgtt ccagtgtggg ttattgcagc 
ttcctgctga tatcctgact gagaccataa 
aagtaaacca agactacttt gcatctgtaa 
ttgtagtact tctcatgtcc atggttaatg 
ttctctattg tttccagtgt ttcttgtata 
caacactttt accttctacc attgatgcaa 
tatgtggagg tttgttttct actgattcac 
cccatgcgtt gcaagaaaat gccacccaga 
caagtattgg caaccctcca gtttctttac 
gaagcaaaat acaaacaaga gttggattat 
gtcccattgc agtaacgcat tttcttcaca 
aaattgcaga aaatcttgga gaagaagagc 
tgggcatttc gatttatttc aatgataact 



gcgcatgcgc atcttggccg ctgctggcgg 60 
cgccgagttg gaggcggtgg tggcagcagt 120 
aggccctgcg gagggcgggg gaagttgtct 18 0 
ggtggctgaa cggcaagatg aatttcctcc 24 0 
cccagcacac agaagccgag acgattcaaa 300 
tattggatga tcgaagaaat gctgttcgtg 360 
tggaagtggg tatacaagct atggaacatc 420 
attctgaaat aataggttat gctttggaca 480 
aagaagaagt agaagaaaat tccacaagac 540 
aaattttcat taagcagcag gaaaatgtca 600 
atttccatgt: ccgctggcct ggtgtgaagc 660 
ctcaggtgca acaaattatt ttagtcagtc 720 
tagcggattc cagggaagtt atacgtaatg 780 
gaagcaatgg tgcaatccag aaaattgttg 840 
acattatttc agaggagggg aacagtgatg 900 
tgctccaaaa cttattaaaa aacaacaact 960 
atattcaacg tatgaaacct tggtttgaag 1020 
agaaagtgac caatctacat ctaatgctac 1080 
accctcctgg tgctaccagt agctgccaga 1140 
agctttgtac tatcctaatg gctactgggg 1200 
atactgtatc agaagttatt cgaggctgcc 1260 
atgcaccttc aaacccacca agaccggcaa 132 0 
aaaggcagcc atttgttttg cgctgtgctg 1380 
aaaaccaaaa aggacaagga gaaatcgtgt 1440 
caggtaattc agtttcagct ggccagttat 1500 
tttcaaactg gtgtgctgct gtggcccttg 1560 
aagaacagtt gctcagggtt caacttgcta 1620 
ttcaacagtg caccaatatt ctttcacagg 1680 
taatgttgct ttgtacctgg ctaagcaatt 1740 
attcagccaa tgttccattt cttacaggac 1800 
agttggtcca aggcttatgt gcccttttgt 1860 
cacttgagag ctacatgaaa gagaagctaa 1920 
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aacaactgat tgagaagagg attggcaaag agaatttcat agagaaacta ggatttatta 1980 
gcaaacatga gttgtattcc agagcatctc agaaacccca gccaaacttt cccagtccag 2040 
aatacatgat atttgatcat gagtttacga agctggtaaa agaacttgaa ggtgttataa 2100 
ctaaggctat ttataagtcc agtgaagaag ataaaaaaga agaagaggtg aaaaaaacat 2160 
tagaacagca tgacaatatt gtgactcact acaaaaatat gattcgagag caggatct-cc 2220 
aacttgagga attaaggcag caggtttcta cattaaaatg tcaaaatgaa cagctccaga 2280 
cggcagtcac acagcaagta tcacagatcc agcagcacaa agaccagtat aatcttctta 2340 
aaatacagct aggaaaagac aatcagcatc aaggttctta cagtgagggg gctcagatga 2400 
atggcattca gccagaagaa attggtagat tgcgagaaga gatagaagaa ttaaaacgta 2460 
atcaggaact tttacaaagc cagctgactg aaaaggactc tatgattgaa aatatgaaat 2S20 
cttcccaaac atctggcaca aatgaacagt cttcagcaat agtttcagct agagattctg 2580 
aacaagttgc agaattaaaa caggaactgg caactttaaa gtctcagtta aactcacaat 2640 
ctgtggagat caccaaacta cagacagaaa agcaggaact gttacagaaa acagaagcgt 2700 
ttgcaaaatc agttgaggta caaggagaga ccgagactat aatagccacc aaaactactg 2760 
atgtagaagg aagactgtca gcattattac aagagaccaa agagttaaag aatgaaatta 2620 
aagctctgtc tgaggaaaga actgccatta aagagcagct ggattcatct aatagtacca 2880 
ttgccatttt acaaactgag aaagacaaac tagagttgga aattacagat tctaaaaaag 2940 
aacaagatga tctcttggtg ctcttggccg atcaagatca gaaaatactg tcattgaaga 3000 
ataaactcaa ggatcttggt catccagttg aagaagagga tgaacttgaa tctggagacc 3060 
aagaggatga ggatgatgaa agtgaagatc ctggcaagga tctagatcat atctagtttt 3120 
caaatctcta ggaacaatag agattatatc ataactctga agatggactc taaactttgg 3180 
tttacctcca tggaaaataa agatttagaa accaggtgga aaacattcac tattaagact 3240 
attgatatat ttttgtaatg ttgccaccca tgttgaaaac cttaaaactg aatttatgta 33 00 
gaacacacat cttttattta aatccatctg aactgtccca aaatgtaatt tgcaaagagc 3360 
ttcaaacacc aaaaatatac cattttaagg catgtggtga caatgctatt tgccttagtt 3420 
cattcgttgt ttactttgca aaatttgtga cttttaagta ctaacacatt tgattgaatg 3480 
tgtattgaat tggaattatt atttctaaat tcagcatttg agtccttagt tgatgatttt 3540 
cttttaaaga aattagttta tttaaaagca agatagcctg ggttgggagt tagtgcatat 3600 
catcagccct ccatctaatg tgattgctaa gatgtgcagg ataggacagc ttagaatgag 3660 
tgccttgaca cctacatatg gctgcattcc atacctcagt gaagaagccg aaggatgctg 3720 
acacacagtg atgtggaggg cattccggag aaacttttgc aacagtgtat taatgtgatt 3780 
ggttttcttt ttgttttaag ctaaacatct aaaacaaatt tcaagttaac agttcataca 3840 
actcagattt agtattattg tttatcataa aacatattat gttcctagtt ttaaaaagac 3900 
ccaaagtatg attaatataa tttattggca tttttgctga ataggtttaa gtcagataat 3960 
agattttaaa aaagcaaatg aggcaatgtg tcaaaattta atgtttcata ataaaaatag 4020 
atacgttgac ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4080 
aaaaaaaaaa aaaaaaaaaa aagggggggc cgctctaggg gtcccaggtt taggtagggg 4140 
tgaatgggag gtcaaggccc ttctaagggg tcccctaatt tcatttcacg ggcgg-cggtt 4200 
ttaaaaggcc gtgactggga aacccctggg gttacccaat ttaatcgcct tggggaaaac 4260 
ccccttttgg caagcgggcg taatagcgaa ggggcccgca cggttcgcct tt 4312 



<210> 18 

<211> 2370 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> unsure 
<222> 2354 

<223> a or g or c or t, unknown, or other 

<220> - 
<223> 2914719 

<400> 18 

gtttctaccg caggcttaag gaggcttcgg gctcctggga tttctgtccg cgct-cctggc 60 
cctcgtcctt cgcgccagag caggttcgca aactcctcag acccttctgc toccggcogc 120 
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cgctttccgc cggggcgaga cccccaggtt caaaatgagc ctgtttggaa caacctcagg 180 
ttttggaacc agtgggacca gcatgtttgg cagtgcaact acagacaatc acaatcccat 240 
gaaggatatt gaagtaacat catctcctga tgatagcatt ggttgtctgt cttttagccc 300 
accaaccttg ccggggaact ttcttattgc aggatcatgg gctaatgatg ttcgctgctg 360 
ggaagttcaa gacagtggac agaccattcc aaaagcccag cagatgcaca ctgggcctgt 420 
gcttgatgtc tgctggagtg acgatgggag caaagtgttt acggcatcgt gtgataaaac 480 
tgccaaaatg tgggacctca gcagtaacca agcgatacag atcgcacagc atgatgctcc 54 0 
tgttaaaacc atccattgga tcaaagctcc aaactacagc tgtgtgatga ctgggagctg 600 
ggataagact ttaaagtttt gggatactcg atcgtcaaat cctatgatgg ttttgcaact 660 
ccctgaaagg tgttactgtg ctgacgtgat ataccccatg gctgtggtgg caactgcaga 720 
gaggggcctg attgtctatc agctagagaa tcaaccttct gaattcagga ggatagaatc 780 
tccactgaaa catcagcatc ggtgtgtggc tatttttaaa gacaaacaga acaagcctac 840 
tggttttgcc ctgggaagta tcgaggggag agttgctatt cactatatca accccccgaa 900 
ccccgccaaa gataacttca cctttaaatg tcatcgatct aatggaacca acacttcagc 960 
tcctcaggac atttatgcgg taaatggaat cgcgttccat cctgttcatg gcacccttgc 1020 
aactgtggga tctgatggta gattcagctt ctgggacaaa gatgccagaa caaaactaaa 1080 
aacttcggaa cagttagatc agcccatctc agcttgctgt ttcaatcaca atggaaacat 1140 
afcttgcatac gcttccagct acgactggtc aaagggacat gaattttata atccccagaa 1200 
aaaaaattac attttcctgc gtaatgcagc cgaagagcta aagcccagga ataagaagta 1260 
gtggctggag actctggctc agccagagtt gtttctctcc actctgcctc atctctgtac 1320 
gaatttgggt cccagccttg ttgggttgtc agccatggac atggatttca acccctggag 1380 
aaaacgatgt cattgttcag cagctgagag cccaggcgtc cgcggcgact tgccgtctct 1440 
ccattccact gcctgttgca gagtttttct gtaactaagg gggttgaggt tattgtagac 1500 
gttagattgc gggcaccgcc agggattttg cagcgcttca gtgtacgtgt tagagaatat 1560 
tggaaaagcg tctgtgagcc ccgtgctgta ttttgtaata aagtcttttg cagattgctt 1620 
cccgagcttc ctttgtcctt ttctcccctt gcccaccccg taacctcagg aacatgcgtc 1680 
ctgcccagca tcagcgtggg gttttgagtt gagatttcag acaccctctg ggaaatgcgg 1740 
caaccttagg ggaaagggag tccccagctg cgctcacttc tgctgcgtgg aacggcagcc 1800 
tctgtgagcc ctggtgggca gagtttgaat gtgtttttcc ttgcttccct cattcccatc 1860 
ttcaaaatcc ccagtgcttt ctggccttgc tgctcagatt tccgagtgac tcaaatgggg 1920 
actgttactt gtgctgggtg acaggccatt tgtgggtaac ctcctaaggc ccaagtgggt 1980 
gacacttgcg tgactttcaa gttagaaccc aagccccctg catgggaatt gccctgaact 2040 
cttacccacc ccggtcccct gcatgggaat tgccctgaac cctcagtagt ggttgtttgg 2100 
ctgttctttt gtattttgtg taatttaaaa ttcctggttg gtaagtattc aaaatgaaat 2160 
tcagctgggc tgagaaaaag tgtgactttt gggtctgtca ctgtatttct cacctgtgat 2220 
ctcaaatgct tcttaggcct ttctgtttgg actaatgtgt gaagtctgac ttgctgagtg 2280 
taaaattcta gtatcgatag tgttgtaaga tgtgtttgcc tactcataaa aaacaatgaa 2340 
aataaaattt catnctgcaa tgttagttgg 2370 
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